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ON THE SYSTEM OF CURVES FOR WHICH THE 
METHOD OF MOMENTS IS THE BEST 
METHOD OF FITTING 


hy 

A. L 0‘Tuoix 
National Research Fellow. 

In Mi, in, . Fisher's paper 1 on the mathematical foundations 
of theoretical statistics the following statement is found: “The 
method of moments applied in fitting Pearsonian curves has an 
efficient*) exceeding 80 per cent, only in the restricted region for 
which /dg lies between the limits of 2.65 and 3,42 and for which 
f$ f does not exceed 0.1. It was, of course, to he expected that 
the first two moments would have 100 per e*nt, efficiency for the 
normal curve, for they happen to be the optimum statistics for 
fitting the normal curve. That the moment coefficients^ and 
also tend to 100 per cent, efficiency in this region suggests that 
in the immediate neighborhood of the normal curve the departures 
from normality specified by the Pearsonian formulas agree with 
those of that system of curves for which the method of moments 
gives the solution of the method of maximum likelihood. 

The system of curves for which the method of moments is the 
l>est method of fitting may easily be deduced, for if the frequency 

in the range d* I) eyfar, %Jdx then y must involve 

% only as polynomials up to the fourth degree; consequently 

y-*H**+*i**+ty+*>9* y V 


1 Philosophical Transactions of the Ro\al Society of L f oiiilon, vol 2U 
scries A (1921), p, 355. 



z CUR FES FOR METHOD OF MOMENTS 

the convergence of the probability integral requiring that the 
coefficient of should be negative, and the five quantities 
a, pj, p£ t being connected by a single i elation, representing 
the tact that the total probability is unity/’ It is with these 
curves having a fourth degree polynomial in the exponent that 
t he present paper is concerned. 

The hi st step in the study of this system of frequency func- 
tions is to find an expression for the value of the integral 

J m f * e ‘ **(* * *+P* *** P 3 * +P* tte 


in other wot Us, it is necessary to know how the integral depends 
on the parameter a, p^ , P*' P 3 * * 

Since a depends only on the unit of measure of % it will he 
^ufficienc for the moment to consider 1 . Furthermore a 
linear transformation on x leaves the value of the integral un- 
changed, If we replace x by x^jc^/4 the integral to he considered 
becomes 






dx=kje 


dx where foe 


-A 


-00 


*■ Px t 7 * 0*2- X ) 

Consider now then the frequency curves y* k& v , 

These curves are typically himodal and max be classified accord- 
ing to the numhei and kind of modes. The positions of the modes 
are given by the solutions of the equation :*<£?, that is by the 
roots of the equation 4x S +2px+%*0. The discriminant of this 
cubic equation tells us that there will be three distinct real roots 
and thus two dtstinct maxima with a minimum between them for 
the curve, that is two distinct modes for the quartic exponential 
curve, it -8p 3 >2?q*p<0. Two roots will be real and equal if 
p-t O. The thrde roots will be real and equal if 
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p^q^Q . the three roots being- in the ease of three real 

distinct toots, if two of the root?* ate equal in magnitude but 

opposite ip sign then and the curve is symmetrical with 

respect to the y-axis. h p*Q $■/& there will he one real root 

and two imaginary roots given h\ tlie three cube roots of ^ 

That there will he a real maximum at the value of x given h\ 

the real cube root of ^ is easily seen Mom the nature of the 

cut vc or In consideiing points at values of % on each side of 

this real cube root of $ . 

■x . 

JleiKv the billowing classes of curves and their tcspectivc 

equations will he considered 
hype J . y*ke 


The cuive v\ hich is symmetrical with respect to they- axis 
and has <ml\ one mode, this mode being at x«o t 

,,, ,, , -('x*-<2bX*J _ 

lvpc II : y* Me , b> O . 


The cm\e which is symmetiical with u-^pect to the y-uxis 
and has two distinct modes at x*± 


Type J i I : y- A& 


-6c *-4cx) 


c/ O. 


The asymtnetrieal curve with one real modi' al jc* yFF 
1 \ pe 1 \ : y= kg 


The general type of curse with the quurtic exponent. 
Type I, 

First evaluate the definite integral 


-4 


' e 

-00 


dx “ 6 


e * dx. 
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i/4 ' . . ~3/4 

Let % = / j dx = fl/4jy cfy Then 


Then 



Similarly it may be shtnui that 



4 



x 


a? x ~ O 


since the integrand is an odd 1 unction. 




x s e 





4 -/ *■*<? "* V* - / rf&.frrjj, 

*■ QO 

_ O’ ^ 

**” J e~* ctz-O, »* 3 

~0O 

4, r/ *” * V* •/ rfizzl^ 


S. 
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Hence if the total frequency is unity then 


/- = 








. l£HL 

^ JJ r?#?/ approximately, 


■^3 = //£ * Q 


*4 */*//, -1/4, 






Type II: 

Consider the definite integral 



Y-v* J J 


c/x. 6>0 
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CURVES FOR METHOD OF MOMENTS 


Integrate by parts letting u* 


e and aV- aX Wien 


. 4 r w* tu ‘ v «- 4t r ;** **"*!*. 

-os - <x> 


Xow obviously I a cannot be zero. Hence dividing by I 0 we 
find /» 4jU# -44^ and therefore 


j 4/a^1 M4 "• 

O st ST 

4/i * ** 


( cannot be zero), 


Now that b is known (calculated from the given data by this 
last formula) it is possible m any particular problem to find by 
mechanical quadrature the value of the integral I c to any desired 
degree of approximation. The simple rectangle formula with even 
a small number of ordinates known will give a good approxima- 
tion. 

Return now to the integration by parts just performed. The 
result takes the form 


4 


4*1 0 Ctfo 


or 


d 2 J n 

db* 


-Jb 



O 


which is a Riccati J differential equation. Riccati’s equation is v 


a Johnson’s Differential Equations, p 227 . 
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y*0. It has a solution capable of 

Ctz* dz 

expression in finite form in terms of elementary functions if m 
is the reciprocal of an odd positive integer. In our equation 
a*~J hence no finite solution for the differential equation is 
possible, That is no finite expression in terms of elementary func- 
tions can be obtained for I 0 . The solution of the Ricatti equation 
here is 

* C,fl+g, «*■ -jj— * —67~ + -ff/ J 


_ Sb J , 7-3b* It- 3b 6 . , 

+4W+37 + ~jr + — — 14 — 57 — 

To determine C ^ and Cg we note that when ^*0 then 

r 09 

and that when b * o then ZTT , 

<2 O 0 = & / *r ' 2 

It is worth while to make certain transformations on the 
differential equation 


Let J o * e ^ ^ 1 / . Then 

d^v * 

37 1 ~b*\s. O. 


Let b*/e=t . Then 
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* 


d*v 

at* dt 


- Y* 


o . 


Let v * t f// * us . Then 





a 


Let /«/<* where Then 

' *>* 4 J 2 * *’&+[**’ ft O. 

This last equation is Bessel’s differential equation 8 with 77* 
Hence its solution is 


w m A\Ji(*x)+B\7zfcJ*Aj£ where 

4 4 4 4 

J «& m P7fS»&(*% 7JFP Vw-i/W/-? <*?/ 

y s<3 /V< rr+l+rj-r / 


The above transformations give 


-c 


<? K 


Hence 

I 0 -Cb*/*J 


’/* 

e 




* 



Johnson's Differential Equations, p. 235. 
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Setting: 6-0 in I a we find 3 = ^ ) 


Setting b*o in 


cTJ 0 

ebb 


we find A * 


W 

** r /. i- jr< /iJ 
V7 


Putting in these values for A and 3 we find finally 

i 0 - «? + jj^r* TFt + ; 

( 2 ) 

"**'«**£ tjgtvr,* 4 

It is worth noting, for purposes of computation, that the ex- 
pression (2) converges much more rapidly than the form (1) 
given above, 4 on account of the factoring out of<P^^ In addi- 
tion the series in (2) have the advantage that the powers increase' 
by Winstead of by 2 as in (1). It will be shown presently that 
ordinarily b is less than unity. But even for b*I it will not be 
necessary to go further than the terms involving A ** to get at 
least seven decimal places of accuracy. For b less than one even 
fewer terms will suffice for this degree of accuracy. 


4 The form (1) is obtained immediately if we write 

•2 • • • ■ ■)<**> 
r *0 ^ ' 

assume term by term integration permissible, and make use of the fact 


r & _ / y \ 

already mentioned that / X ^ ~ y 

jJ 


'p±L 



(0 


Cl' RMS H)!i METHOD OF MOMENTS 


From the point of view of the Ricatti differential equation n 


can he -shown that I Q = 


* at 

e 


ctjc 


is the solution of 


■ c£I 0 

db z 


-Zb 


dip 

db 


I c -o 


when the solution R sought in 


the form of a definite integral. 6 For the diffeiential equation 
brffOjw D and gl and ^ are polynomial 
ill & with constant coefficients is satisfied b\ 


cC 

where c is a constant, T~(bJ is the reciprocal of and at and 

fi are so chosen that for all values of fr 

e bt+/*rvwjdi\ * m Q 
L J ^ 


Let /dfZyr~ k. 


Then 


( 2 4 ftj- /*?/, 

J/tft) Tftjctt- JofO* 


and 
Hence 


€ * 




* O for all values of b if *0 
°C 








5 A R. ForS>th's Differential Equations, 6th edition. 1929, pp. 277-280* 
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C 

J j 






Now let and c=- l£F . Then 



An idea of the variation ofZ as a function of 8 can l>e ch- 
ip 

tained from the following table calculated from (1) for values 
of & at intervals of 0.1 from 0 to 1 and using /“} / jfj~3.6£56fO> 


J.J254J IT The results are plotted in the accompanying 
graph, Fig. 1, 


6 


00 

1.812 805 

0.1 

1.945 063 

0.2 

2.099 726 

03 

2.282 225 

0.4 

2.499 648 

0.5 

2761 349 

0.6 

3.079 783 

0.7 

3.471 748 

0.8 

3.960 152 

0.9 

4.576 578 

1.0 1 

5.365 158 


The modes are at £ \/fr * Ordinarily the ordinate at the 
modes will not be greater than e * £ 718JZ times the ordinate at 
X*0 . Hence ordinarily it will not be necessary to consider 
values of b greater than unity. 
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\ 



FIG. I 


I. 


■/. 


05 -(x+- 26 x*), „ 

xe c/x*<Q. 


I z *fz*e 




4 




-r fx*46i6*) 


’e 


dx *0. 


*4 


- op 


7 - /Xi"€t»J Sitl, 

J *» 'J* * ■ ***(£/ 




-£„,/■/ *—‘* v o. „.o,j,43 ~ 
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To find these derivatives one might use the relations 6 

, 5 4 s mt w ■ */ 


But term by term differentiation is permissible and for this pur- 
pose it is simpler to use (1) rather than (2). We find 




. b* 5b* 95b 4 1 3-9-5 b 6 


_ > r (i)\ 


,, 3b s 73b*1f-r-3b 

~7T 




5b* 9 -5b* 

5/ 


'rtf) 


i+ 


3b* 7-3 b * 
& 4/ 


46 * 2 , < 4 '\ 


2! 4/ 



Since b b O hence I„ and all its derivatives are greater than zero. 
Now the total probability is to be unity hence take k ~ 




-Li 

‘ j ■ 


o. 


Type III : 



off. 

Wb 


M i =/ 5 *<3, 



4l a 


etc* 


y* Ae 


-fr* 4cx) 


• Whittaker and Watson, Modern Analysis, third edition, p. 360.- 
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This curve is not symmetrical. But, obviously, changing c 
to - c has the same effect as changing % to -x or simply revers- 
ing the shape of the curve and the distribution from which it 
arises Hence it will be necessary to consider only positive values 
of c , As stated already, it is easy to show that there is a real 
mode at the point given by x equal to the real cube root of c . 
If c=l thcny^A'<? S , that is e 3 times the value of the ordinate 
at x*Q . Hence usually C will not be as great as unity. 



' FIG. II 

(V fe) y s fa -ft* 4 ™) 

Let 1 0 - f 

- QO 

x ^ 4 cX 

Integrate by parts letting u=e’ , c/r=e afx. 

Then 
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15 


1 1 nice 





J o 




With tln.s value <>f c calculated from the qneu data median- 
iial quadrature uni he u-ed to find an approximate value foi f a 
I'lien let 


H a 


X 

To 


The result nt the uilegtatum hv patts could have heen written 
m the form of the dilhrential equation 


ci 3 L 


—3 -6<cl 0 -o. 


e o 

ddmik. unm <>\ the- .solution uf the dilfeiential equation 


/'OO 

l on\u u vl), it i* v t \>y to show that 1 = / e ^ y <dz i,s the 


d 3 '/ 

d'C 3 


64 cy- O. 



lor. hue 

6 *. *<%/•!>* 


and 


^ct- t*/£f6 1 


/& 

-dfoi all values of c if 




i fence 



16 


CURVES FOR METHOD OF MOMENTS 




-A/?"-'** 

M 


* Now let f ** 4 x and c=-I6 . Then 



00 -Air *£ 4cx), 
e ax 


4- 


An expression for the value of f c can be obtained either by 
hading the series solution of the differential equation and deter- 
mining the constants by setting c-O in I 0 and its derivatives, or 
by expanding e 4cz in series in the definite integral itself and 
then integrating term by term. 

r. */*-''**■ ■“*■><& 

'“CO 


4 cx) f 00 -fo*. 4cX ) / 

e c/x + / e (X cx 'dx 


e <fx+ 


e c^x 





-4CX 

+ e 


XcX 


1 *Ja 


e cos /7 {4cxJ c/x 



(4cx)* f4cx)* f4cx) 
J2/ * 4i + 6! 


6 



c/x 
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co 

=£i: 

77 ~ O 


2n 


24cJ 
CSvJJ 


'x*” e 




4 

dx 


1 f ¥s2lZ r CdAdl 

*2».o 72»J/ 4 


Arf 1 ) 

2 ( 4' 


24 cj 4 l 524cJ 
1+ 4-4' 4*3/ 




Ac A 3(4c) 

-/■ . — __ ■+ 


2/ 4 6/ 


3 9'SAcJ U 

4 3 -12/ 

73 ^4c) 
4*JO> * 


These series may be differentiated term by term to obtain the 
derivatives of I 0 and hence 


T / 4cxJ / tfr 

L- / xe &x*± ! ±±o 

7 J 4 a! c 


r f ** x -7* - ■* xxsj /£ ,2 c/*I 0 

T *-J * * «*■& -jf s . 

' - CO 

/ /*!» -7x*4cx) ,-f . Stf 3 ! 

J,-J ** dx-fy 

- oO 

y f** 4 -7A- 4 cx) ,1 )4 <2% 

!*'/*« <** *(4) 
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If a- is replaced by x-\ Vc the effect is to translate the modal 
-value of % to the origin. The equation of the curve then beiunes 

i -U 4 +C 1 x 3 +c z x.*+Cj%. + C4) 

y-j e 

J.Q 


where 


Type IV: 


c x *-4 yc . 

Q - 6 3 /c z , 

Cj= -8c, 


c^= 5 c/c 


y= ke 


+ <$x) 


Consider the definite integral 

/>/! 




If /i^.(?we get Type I. \i p^O J we get Type II. If p*Oi 
<ftO we get Type III. Hence consider now p / Qj q V O, 

Integrate I by parts with u=e F and ctv*e <2& Then 




1 / , , -fii*+px*+qx) 

= J (Nx Px 

J -oo 

4 fZj P ******* 9 *^ _ i£ 


9 j 


xe 


Px, 


~ C0 
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Now divide by I 0 and multiply by , Then 

Begin again with I 0 and integrate by parts, this time with 


gx) 

e and 


ctt'* . Then 


I a - / ('4* *+gxJe 


-Zx *+px z + q*) 


at# 


-00 

-4/ 


4^ / ^ '-fr 4, +p**+$X) 


afx+<?p f 




c/z 




Divide by I 0 . Then 




Now substitute q*-4ju‘ 3 ~£p/Jj in + q/J and wc get 


P~ 




?- • -4p' -yu; 



( 3 ) 
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The result of the two integrations by parts can be written in 
the form of two simultaneous partial differential equations. They 
are 




. dl 0 




dp dq dq 


, - Mo dI ° T n 

4 dp* dp * dq ~ J ° = °‘ 


. . -fr +P*J 

Let o a = / e 

- co 


Then 







-(X 4 -tpX*J 


dx= O. 


r o 



-Cx 4 + px*+ 


- (x 4 -+ px*) -ax 

e e ’ c/x 


* OP 


-s 


L s 


px* J 



(9*J? 


J/ 



dx 



A. L. O'TOOLE 


21 


o° <3*2 * 

•£ hn< * 






a?# 




c-v 


£!% 

dp n 


°* <g* 4 


r v s 
ho Gij/ ***** 




r 2/7+J -fa t **■/?#*+ a* J 


- r-v 

d 9 


&& st 

Eo r ~ J ^ 7TTT)! $*»+*£ 


When the values of p> and <? are calculated from the data 
of any given problem by the formulas (3) then values for Z c , 
I 7 , Ij, , etc. can be obtained by mechanical quadrature. 

For two real, distinct modes ~<3 Y< P < &J • Hence if 
-£<p<Ot\itx\ J, 34 > g then one mode flattens 
forming a point of inflexion with a horizontal tangent at the min- 
imum point. Changing g- to -g. has the same effect as changing 
x to^x and hence g is a component of skewness of the curve. 
If the curve is so placed that the sum of the values of x at the 
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modes and at the minimum point is zero then the equation of the 
curve will be of the form 

y.fre't*****^* 9 *' 


If now we change the scale of & by replacing by then 
we are led to the functions of the form 

y .tr e -**(*******9*J, 


Performing the two integrations by parts, as before, on the 
integral , , 


„00 


/ e 

~0o 




d* 


leads to the relations 


£ * - 40i 9 +SMM**J*%eMp, 

nx i* - *(*4 * m#, ) {A) 


if 


r n -fx** pz* ), , 

7 * <** m ' I atevJ t 


then 


f°tn ~**C*1 *+ jox*+qx ) t 1 . 

L d * " 2 n 6* (*P. a **?)■ 

In particular, 

z ° ’ T ° r °- o)m /* * * c/x=J-g-rC : j) t 
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1 


f* f £ Hi 1 / \ 

\e* &*»& 4 


Mi * / i ■ / A * = °> 

i /i 4 raj 
^- 4 / 4 - 3 - -JyTj ' 

+ 

Ms ~ 




J_ 

< r *e - 4s* 


Msn-S = ^ ^ 


z/ -y /I - 1 CClZjJ . 

M;„ / s„ /J a p’fXj 


in the case of T>pe I when 

-a z x < , 1 rifl ) 

p*<g«0 and hence from (4 J , or as can be shown directly, < 3 e *^~~, 

*A# 


In T>pe II, 

-dV-X^pxV 1 z" d z/M+ ox i) , 
y*y c e t 1= / e d (x * p * J **. 
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9*° 


and hence 



2 PjU £ + 


In the Type III where 


y-y<,* 'y 



**#*■*1*^ 


/>=° 


and hence 






In general, since/? and q are determined when the modes and 
minimum point of the curve are known, theoretically at least, 
is fixed by the relations (4), In practice, however, this would 
mean that the accuracy in the determination of a 2 would be con- 
tingent upon the accuracy with which the modes and minimum 
point are determined. Hence other methods for fixing will be 
required in general. Now if in I(ap / <?^we replace p and q 
by (4) which involve only <9* and Quantities calculable from the 
given data we have a function of a alone, say ffa). It will be 
sufficient then if we determine a value of a such that ffa)* N 
where N is the total given frequency Then fix p and q by (4) 
and the modes and minimum point by 2 px + q - O. 


The points of inflexion are found from the equation 


grjj/ 

cXX ^ 


- O 


and for Type I are given by . Hence 

ys 

approximately. For Type II they are given ky 

* O . For Type III they are given by 16a*x 6 +3di*qx 3 
- O h And in general they are given by roots of the 

equation 


1 6 a *x 6 + 16 <t%x 4 + 64 gr-tr^ *0. 



A. L O’TOOLE 


25 


It will be noticed that the distribution given by 
y.ye’^" 

can have the Mean at the origin if and only if q*0 , that is, 

if and only if the distribution is symmetrical. Now replace x by 

X*m, The area remains the same and hence also V . 

* 0 

The equation then is 

where 

p^6m £ ip t 

P 3 * 

% 

P 4 * *n 4 *m*p-nvq ) 

and p and q are given by the relations (4) above. An integra- 
tion by parts with u»e* ^ shows that 


tW/i+SPiM'i+tfii/i'jf +P 9 Mp*L 



TYPE I 
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This small beginning of the study of the system of frequency 
curves with the quartic exponent will be concluded here with the 
construction of artificial illustrations of Types 1 and 11. 

TYPK T 

r- -prj e '"" ■ 



y 


**y 

o.o 

0.5516313 

, 0,0000000 

0.0000000 

0.1 

.SSI 5762 

.0055158 

.0000551 

0.2 

.5507494 

0220300 

.0008812 

0.3 

S 471811 

, .0492463 

.0044322 

04 

5376888 

0860302 

.0137648 

0.5 

.5182096 

.1295524 i 

.0323881 

0.6 

.4845787 

.1744483 

.0628014 

07 

.4338852 

.2126037 

.1041758 

0.8 

.3662367 

.2343915 

1500105 

0,9 

.2862255 

.2318426 

.1877925 

1.0 1 

.2029338 

.2029338 

.2029338 

1.1 j 

.1275846 

.1543774 

.1867966 

1.2 

.0693579 

.0998754 

.1438205 

1.3 

.0317147 

.0535978 

.0905803 

1.4 

.0118376 1 

.0232017 

.0454753 

15 

.0034917 

.0078563 

.0176767 

1.6 

.0007861 

.0020124 

,,0051518 

17 

.0001301 

.0003760 

.0010866 

18 

0000152 

.0000492 

0001596 

- 1,9 

.0000012 

0000043 

.0000156 

2.0 

.0000001 

.0000004 

.0000016 


5.2758155 

1.6899455 

1 

1.2500000 


To.,1 frequency .IT^rOUSSJ-O.SSiejl - *i.o00Gooo 




111, 6899455)- O. OOOOOOO 

'' io 

2 fl. 1500000) -O. OOOOOOO 


03379891 

<0.2500000. 


JO 










TYPE II 


28 



= 1 . £>=- 5 , <?= 0 . 




A. L. O’TOOLK 


29 


FId7o99 " 


TYPE II 


X 

y 

f - - - 

x*y 

x*y 

0.0 

0.4572267 

0.0000000 

0.0000000 

0.1 

A594725 

.0045947 

,0000459 


.4657175 

.0186287 

.0007451 

0.3 

.4744135 

.0426972 

.0038427 

0.4 

4827888 

.0 772462 

.0123594 

0.5 

.4867153 

.1216788 

0304197 

0.6 

.4808614 

.1731101 

.0623196 

07 

.4594725 

2251415 

.1103193 

0.8 

.4180410 ! 

.2675462 

,1712296 

0.9 

,5556970 

.2881146 

.2333728 

1.0 

.2773220 

.2773220 | 

.2773220 

1 t 

.1936552 1 

.2343228 | 

2835306 

1.2 

.1181056 

.1700721 

2449038 

1.3 

0611958 

1034209 

1747813 

1,4 

,0261429 

.0512401 

.1004306 

1.5 

.0089145 

.0200576 

.0451297 

1,6 l 

.0023433 1 

.0059988 

.0153571 

17 

.0004575 

0013222 

.0038211 

1.8 

.0000638 

.0002067 

0006697 

1,9 1 

.0000061 

.0000220 

.0000795 

2.0 

.0000004 

.0000016 

.0000064 

i 

I 

5 . 22 K 6133 

2 0827448 

1 7706859 


Total frequency 


M, = 




0347448) - 6? OOOOOOO 
lO 

2fj. 770 6659) -O. OOOOOOO 


0.41654996 


0 . 05413756 , 


+ JO 

From relations ( 1 ) or (2) it is found that when b - 0.65 , 


(i.e. p = -0.5,q*0) 

then I a - 6. 187099: Conversely, the formula b = 

gives, retaining six decimal places, b - 0. 650000. 

(To be Continued in May Ishue) 














ON THE LOGARITHMIC FREQUENCY DISTRIBU 
TION AND THE SEMI-LOGARITHMIC 
CORRELATION SURFACE* 

By 

P.vr-Tsj Yuw 


INTRODUCTION** 

The method of treating frequency curves as developed chiefly 
by Kdgeworth, Kapteyn, Van Even and Wicksell occupies an 
important place in both theoretical and applied statistics. The 
essence of this method may be briefly summarized as follows . 

Suppose a function of the variable z is distributed according 
to the noimal law of error. Then, z certainly cannot be also 
normally distributed, unless the function is a linear function of % , 
Without losing generality, we shall write the normally distributed 
function in standard units as X* f(%)* Thus the origin of x is 
its mean and the unit of x is its standard deviation. The relative 
frequency of values of X between x and x+ dx, is, therefore 



and the relative frequency of values of 

%+d* is 

M 


X Ixitween % and 




dx. 


Thus if we have an observed frequency distribution of jg 
and we know a normally distributed function of % , then we can 

*A Dissertation Submitted in Partial Fulfillment of the Requirements 
for the Degree of Doctor of Philosophy in the University of Michigan. 

* * Papers wntteq by the writers mentioned in this introduction are 
listed under the writers' names in the Bibliography at the end oi this paper. 
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graduate the distribution of Z by using this formula. Edgeworth 
calls this method of graduating a frequency distribution the 
method of translation. In two papers on “Skew Frequency Curves 
in Biology and Statistics” published in 1903 and 1910* j. C. 
Kapteyn elegantly set forth a theoretical foundation of this 
method. Later Wick sell gave a similar justification. Both of 
them based their “genetic theory of frequency”, to use WickselFi 
terminology, upon a generalized hypothesis of elementary errors, 

In the present paper, we are interested only in the important 
special case where log The frequency function of 

% , then, becomes ; 

which is called the logarithmic frequency function.* 

Numerous papers have been written on this frequency curve. 
Among the early writers were Francis Galton and McAllister. 
But a systematic treatment on the properties of this curve from 
the standpoint of mathematical statistics is still lacking. Hence, 
in the hist part of this paper, such a treatment will be given, thus 
leading to some interesting relationships among the characteristics 
of this curve. 

Various methods of determining the parameters of this fte- 
quency function have been proposed by writers on this subject. 
Pearson is the first writer to make use of the method of moments. 
Later this method was also applied by Jorgensen and Wieksell. 
In this paper, the method of moments will be considered and a 
table will be provided to facilitate the computation of the constants 
by this method. 

Edgeworth, Wieksell and Van liven all have contributed in 

* For a. justification of this frequency function based on Weber - 
Fee liner's Psychophysical Law see the “Calculus of Observations*’ hy K. T. 
Whittaker and G Robinson, pp. 217-218. ( Blackie & Son bid, Loudon 
and Glasgow, 1929) 
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n 

extending the method of translation to correlation surfaces. Wick- 
sell’s logarithmic correlation surface is particularly noteworthy. 
In the last part of this paper, a semi-logarithmic correlation sur- 
face of two variables will be developed and its properties studied. 

The writer wishes to express his appieciation for the assist- 
ance Professor Cecil C. Craig has given him in making this study. 


PARTI 

THE LOGARITHMIC FREQUENCY DISTKI HUTION 

For the sake of clarity, it is desirable to state at the. outset 
that the logarithmic frequency distribution represented by 


y 


*&rr cfe'd) 




(*°g 




0) 


is unimotlal and has three parameter. The parameter a is the 
finite lower or upper limit of z according to whether t> is posi- 
tive or negative. In the following discussions, unless the sign 
of b plays an important role, we iihall take b to he positive 
and <9 to be the finite lower limit of z . However, the results 
of our discussions can be easily modified to cover the case where 
b is negative and a is the finite upper limit of z . 

In the first eight sections of Part I the properties of the 
logarithmic frequency distribution will be treated from the stand- 
point of mathematical statistics,* and in section 9 the numerical 
application of this distribution will be discussed. 

1. AVERAGES 

We shall first give the analytic expressions of four different 
averages of % and then observe their relative magnitudes. 

*Some topics under consideration here in regard to the properties of 
the logarithmic frequency distribution have also been discussed by many 
writers, among whom we may particularly mention McAllister, Kapteyn, 
Pearson and Pretorius. See the references under these writers names in 
the Bibliogi aphy of this paper. 
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By definition, the arithmetic mean of Z is 

aa C z 

m - f zFfe)dz - be * + a . 

The logarithm of the geometric mean of z about the point 
Z* a is given by 

J Jdgrfe- a) Ffy)dz *logrb. 

Hence, the geometric mean of Z about z * a measured from 

Z” o is 


m^r « b+<* . 


Since the median of z corresponds to log O 

it is equal to 


rr?-/ * b+a . 


JP/b) loa^T-tc* 

Setting the derivative — ~ w F^z) 

dz 


equal to zero, we obtain the mode of z as 


j 

7r? o m oe + a . 


Thus, the geometric mean and the median are equal. More- 
over, 

v * *n 

2. POINTS OF INFLECTION 
The second derivative of F(z ) is 

*g)\**+ Uo 

at 9* ~pT( >T^7? “ J • 
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The roots of the equation 

(log / 3 C * log O 


are the points of inflection of the logarithmic frequency curve. 
We shall denote them by % s and Z 2 , 


k i - ** 
z 4 - be 



+a 


Note that the quantity under the radical sign is always posi- 
tive and greater than one. Its square root is, therefore, greater 
than one in absolute value. Hence, Zj< b+a < * That is, the 

geometric mean and the median of x lie between the points of 
inflection. 

Furthermore, if we observe that the points of inflection may 
be written in relation to the mode as 


Zj - a 
Z 2 ~ 




fa-s/nl, ) 


we see that < n ? 0 < . 

But the mean does not always lie between the two inflection 
points, since 

Zj-d.fw-dJe * 9c y 


Zj - & =r 
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Obviously, Z i is always less than the mean. But when C Z >4/7 \ 
the mean is situated above both points of inflection. 

Now, the relation of the averages and the points of inflection, 
when c*<4/7< may be expressed by the inequality 

< fr?o * 7*4 = m < rr? < 


which holds for almost all ptactical cases, since c * rarely exceeds 
4/ 7 in practice. 


3. HIGH CONTACT 

A frequency function is said to have high contact, if the 
function and all its derivathes vanish at the upper and the lower 
limits of the variable x . We know that the logarithmic frequency 
function vanishes at both the finite and the infinite limits of z . 
It can be easily seen that all its derivatives also vanish at these 


points, if wc make the substitution which will 

throw every derivative of the logarithmic frequency function into 
a product of two factors, one being a polynomial in z ' and an- 

other being e *** where k is a positive integer. Thus, 

* 

it is obvious that all the derivatives become zero, as z* approaches 
* 00 i which correspond to the finite and the infinite limits of % . 
For instance, this substitution will put the first derivative of the 
logarithmic frequency function J?7k)> 


PSTT FM 


into the form 




*'-C* -&*'*+*** 

JeTTCSb* € 


which clearly goes to zero as z* approaches t oa , that is, as 3 
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approaches and infinity. 

The logarithmic frequency function, therefore, has high 
contact. 

4. MOMENTS 

We shall study the practical application of the method of 
moments to determine the parameters of the logarithmic frequency 
distribution in section 9, But at present we must know the rela- 
tionships between the parameters and the moments in order to 
discuss the properties of dispersion, skewness and kurtosis 

First, vve shall express the moments in terms of the param- 
eters * 

moment of % about the point z« is given by 

jUf * fcz-a) s e . 

And we also have the recurring relation ^ * 

The moment of % about the mean is 

/Up 

Consequently, the s-th standard moment of z, is 

Setting $ equal to 3 and 4, we have 

% * C * VC* ScJ + * e *+ 6e c *+ 6) 
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which will he discussed in connection with skewness and kurtosis. 
Note that the sign of c 4 follows that of t , because the sign of 

v M 

the third moment of z about the mean is determined by b . 

Now, we want to express the parameters in terms of the 
moments. It is dear that there is an infinite number of ways to 
accomplish this, since there is an infinitude of moments. But we 
are particularly interested in the expressions of the parameters 
in terms of the mean and the second and third moments about 
the mean. Letting <u) * <? c . we, have 

777* + c? 

yUj * tu-J) 

( 2 ) 

^ -- 

Solving these equations foi the parameters, we hud OJ is the on!\ 
real root of the cubic. 


OJ ^ Sco*-^* ctfj* o 


Hence, the parameters <r , b and a may he expressed as 


Q)jl 



where the sign of b follows that of c^. and The prac- 

tical application of (3) and f4) will be discussed iti section 9, 
We shall now turn our attention to some other properties of the 
logarithmic frequency distribution. 

5. DISPERSION 

The dispersion of % about the mean may be measured by 

** 4 

the standard deviation, * be ? fe -jJ ■ Denote the 
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deviation of % from, the mean m terms of the standard deviation 
by 4* Cz- 77 ?)/& . Then, with the aid of formulae (1), (2) 
and (4). we obtain the distribution of i as 


/Fir c \jr/e ] 



fthcMo (& c *-i )£ 


*,/r* c *+*) 


takes the same sign as , 


We know that tor the normal distribution 50% of the total fre- 
quency lies between the limits 6745 and /=• /• 6 746 , 

Xow, we want to know the si mil ai limits of i for the logarithmic 
distribution Foi that reason, we write 4 directly in terms of 

the normally distributed function 

c * b 




jg- 777 

~ar* 


» 

re z -s) 


re c *-i) x 


( 6 ) 


Placing x equal to 67&3a.x\d +.6745we have at once the limits 

f e -.6745 c~% 2 ) 

*l m 1 

fa**- JJ 7 

, _ fa 

between which 50% ot the total frequency is included, These 
limits are two quartiles and obviously depend on C . It is clear 
that one can also locate other deciles and percentiles of if by using 
( 6 )- . 

An abstract measure of the dispersion is the coefficient of 
variability which expresses the' standard deviation in terms of the 
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mean. For the logarithmic distribution, it is 




a 

77 ? 


3 A 


( 7 ) 


which shows that in a logarithmic distribution the larger c 2 is, 
the greater is the variability. 

It is interesting to note that if we also express the deviation 
of & from the mean in terms of fo-j ) and denote it by 




X- 777 
777- a 


•re c 


we ha\e by (5) the distribution of z // m this simple tovm 


J 



( 8 ) 


0 S Kill I • YFSS 

It has been proposed to tt.se °Cj /■£ ot °ij as a measure of 
skewness of a frequency distribution. Fot the logarithmic curve, 
we have shown that 

=7<? ft? c ** js) 

( 9 ) 

I / 

< >r f & - f J* + 2 J • 


Hence, the absolute value of increases with c, Since c can 
take on any finite value whatever, the skewness of the logarithmic 
curve as measured by ot d can also have any finite value. More- 
over, as we have seen, ^ of the logarithmic distribution can be 
positive as well as negative. 

In Figure 1 are shown four logarithmic curves with , 

and with varying & Various parameters calculated 
from formulae (4) and important characteristics of these curves 
are exhibited in Table I. 
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When c^O ,<^also vanishes,. In fact, the logarithmic curve 
approaches the normal curve of error, as c goes to zero. This 
can be demonstrated as follows With the aid of formulae (4) 

we can write the normally distributed function x « Jog? as 





m c{. 


U*gL (***-!)* 


£ z* 777 zJjjH - 771 J * ( e c ~ 1 ) + m 

~ 2 a c ~ c 


Now, it can be easily seen that 

Jim z * ? m £? 
c+o & 

which is a linear function of % . Hence, the logarithmic distribu- 
tion of * approaches the normal distribution as c approaches 
zero. 


TABLE I 

Parameters and Impoitant Characteristics of the Logarithmic Curves 
with rrf*0, <f*l and Specified & 



2 

1 

4 

-4 

CO 

1.0044 

1.1038 

2.0000 


o 

.0665 

.3143 

,8326 

.8326 

a 

-15.0222 

-3.1038 

- 1.0000 

1.0000 

b 

14.9890 

i 2,9543 

7071 

- .7071 


- ,0332 

-.1495 

- .2929 

.2929 

m e 

- .0991 

- .4274 

- .6465 

.6465 

*1 

-1.09 

-130 

- 9341 

.9341 

*4 

.90 

.50 

- .0532 

.0532 

D 

.0666 

.3221 

1.0000 

1,0000 

* t 4 -3 

.0712 

1.8295 

35.0000 

35.0000 
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Another measure of skewness is defined by Pearson as 




m- rn a 


For the logarithmic curve, it becomes 


^ 1 

(*>-!)* 

which has a maximum value equal to . 6$6J \ when a>= l.ZXOX 
and J57J This, however, does not indicate that the skew- 

ness of the logarithmic curve is iimited. Rather it shows that X 
is not a satisfactory measure of skewness, so far as the logarith- 
mic curve is concerned For any measure of skewness should 
characterizx the skewness of a curve without ambiguity, and X 
fails to do so in case of the logarithmic curve. For instance, 
when we say that a certain logarithmic curve has X^.32. we 
may mean either a logarithmic curve with oC f * 63 or one with ' 
^* 3600 . 

When the logarithmic curve is only moderately skew, X ap- 
proximately equals This can he shown as follows* Letting 

h -i, we have * 

and 3h + h 3 * 

Hence, tor small \/?\ and hence small |«*j| X approximately 
equals ^ /?. For instance, when .£ , X « OS7 £/which 
is approximately J 

We may mention here that for the Pearsonian type 111 curve, 
the relation qCj //? always holds. In fact, it appears from 

Table II that the type III curve and the logarithmic curve are 
very similar for small |«^j. But the differences between them are 
already pronounced for , as we can see from Table IIL 
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TAHI.K II 

Ordinates and Areas nt the Logarithmic Curve and the 


Prnrsniitnn Tipe III Curse 
)/7*0 0" a 1 


z 

Ordinate at # 

1,0# luive T>pe III 

A tea from the Lower 
Limit to % 

Log Curve T\peJIJ 

- 3 5 

,1)003 

.0002 

1 

.0000 

- 30 

(1020 

0020 

0005 

0004 

- .>5 

0)24 

0125 j 

0034 

0034 

- JO 

.mi 

04' >2 

0172 

0171 

- i 5 

. R>7 

,1241 

0607 

.0607 

- J 0 

2587 

2501 

,1579 

!S#2 

-* 5 

3002 

3687 

.3178 

3172 

p 

,399] 

3980 

5132 

,5133 

,5 

536/1 

.3364 

,7006 

.7002 

1 0 

,2267 

2267 

.84)8 

t8417 

1 5 

1242 

1245 

.9285 

9284 

2,0 

,056# 

0567 

,9720 

.972) 

2 5 

0217 

,0217 

.0906 

9006 

3.0 

1KI72 

0071 

9972 

9973 

3,5 

0020 

.0020 

9991 

,0993 

4.0 

0(KV. 

.(KKJ5 

.9998 

.9998 

4.5 

0002 

.0001 
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TABLE HI 

Ordinates and Areas of the Logarithmic Curve and the 


Pearsoman Tvpe III Curve. 
rr?*0 °Cj 9 1 


X 

Ordinate at % 

\ 

Log, Curve Type III 

I 

I 

Area from 

Limit 

Log. Curve 

the Lower 

to % 

Type III 

'2.0 

i 

.0084 

0 

.0009 

0 

- IS 

.11% 

1226 

.0259 

.0190 

- 1*0 

.3364 

3609 

.1398 

.1429 

- .5 

4408 

,4481 

3442 

.3528 

0 

4040 

3907 

.5624 

.5665 

.5 

2m 

,2807 

.7363 

.7345 

1.0 

1791 

1785 

.8520 

.8488 

IS 

.1017 

.1045 

mo 

.9182 

2.0 

.0548 

0573 

.9590 

.9576 

* 2.S 

.0295 

' .0300 

9785 

9788 

3.0 

.0144 

,0151 

.9895 

. 9897 

3.S 

.0073 

0074 

.9948 

.9951 

4.0 

.0036 

.0035 

.9977 

.9977 

" 4.S 

,0017 

0017 

9987 

9990 

5,0 

.0009 

.0008 

.9993 

.9995 

5.5 

.0004 

.0003 

,9997 

.9998 

6.0 

,0002 

.0002 

9998 

.9999 

6.5 

.0001 

.0001 

.9990 
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7. KURT OS IS 

Another important characteristic of a frequency curve is 
kurtosis measured by g ( 4 4 - 3) or simply by rp - which 

equals zero for the normal law of error. If the mean and the 
standard deviation are taken to be the origin and the unit, respec- 
tively, then usually the frequency of a curve in the vicinity of 
the mean is in excess or in defect to that of a normal curve ac- 
cording to whether rp is positive or negative. A curve is said 
■ to be platvkurtic, if yp > O . It is ieptokurtic, if rp < O. Thus, 
the logarithmic curve is always platvkurtic. for its rp is 

/p m fco-J)f cu ^ 3co *+ 6cj+ 6 ) 

no) 

or rp=*co*+£co s + 3 co 3 - 6 

and <o> J Since the logarithmic curve has only three parameters, 
there exists a functional relationship between its skewness and 
kurtosis. This relationship is given through the parameter to by 
(9) and (10). We may further deduce the following relations 
from these two equations : 

rp is always greater than jr ^ . This follows from the fact 
that £ - 3«c 3 » ('co - J oj 3 + 3 co > C?. 

For|<^ |< 6,4 4 \ we have > rp , since 

3 - r? * { aj - I ){- co 6 co + 6 > ) > O holds, provided 

co<4,<3 . which corresponds to | <*^j < <5*. 44 1 
Kor \« 3 \<zis . we have .since 

2 - >? co~J JC- ' to 3 - CO £co + zj > O 

holds provided co< 44 , which corresponds to \ 4 3 \ 

Since practically the value of Jc^ Jean hardly reach 0.44ot 
even the relations just stated hold for all practical instances, 
The relationship existing between rp and is sometimes 
used as a criterion for applying the logarithmic curve to observed 
data We shall discuss this point in section 9 
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8, POWERS, ROOTS AND PRODUCTS OF THE LOGA- 
RITHMICALLY DISTRIBUTED VARIABLES 

If % is logarithmically distributed and has "a* as its lower 
limit. W « also so distributed, k being any constant. 

This follows from the fact that if is normally dis- 


L 

tributed, .so is to •* g leg? . From the frequency function of 

Z given by (1), we find at once the analytic expression 

of the f requeue) distribution of W to be 


l 

J?T7 c/rlV 


l 

e *c*k* 



( 11 ) 


We have learned from the preceding sections that a logarithmic 
distribution represented by (1) with larger c has greater varia- 
bility, skewness, and kurtosib. Thus, if k * > 1 y the variability, 
skewness, and kurtosis are greater for IV than for z . On' the 
other hand, if k*< I , the distribution of z has greater varia- 
bility, skewness, and kurtosis. 

If the logarithmically distributed variables % u z n 

are independent and have for their lower limits, a 1 , < 2 ^ 
then the product 

is also so distributed. This follows from the fact that if 




rijw- 


9 A v 


are each normall) distributed and are independent, their sum also 
obeys the normal law of error. 

Since the variables are independent, the frequency distribu- 
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tion of these n variables is represented by 


fz n )ata? 1 ctZj ... c/z n (12) 


where = 


J 


Jtncsrzi - ■> 


~A* fa 


Substituting *,-<?, = -{z n -a n ) in (12) and integrating 

the resulting expression with respect to Z Zj ••**, )€ n successively 
over the respective ranges, we have the distribution of V as 

i fa~5^7. J (13) 

&rr ~^Uc*Y dY 

Since the sum, C ^ • • • ■ + c* , is greater than any cj? , 

the distribution of Y has greater variability, skewness, and kur- 
tosis than that of each individual variable. 

9. NUMERICAL APPLICATIONS 

Many methods of fitting a logarithmic frequency curve to* 
observed data have been proposed. But only the method of mo 
ments will be considered below.* 

The method of moments is very simple to apply. Jt consists 
of placing the computed moments in equations (2) and then 
determining the parameters by solving these equations by formulae 
(3) and (4).f The only step of computation which requires 
spme time and care to obtain accurate results is the solution of 

* Among other methods of graduating the logarithmic frequency dis- 
tribution, the graphical method proposed by Kapteyn and Van Uven is 
especially useful. For a description of this method, refer to their paper on 
“Skew Frequency Curves in Biology and Statistics, 2nd Paper*’. 

fin his paper, “On the Genetic Theory of Frequency", Wicksell also 
showed the application of the method of moments to the logarithmic fre- 
* quency distribution. However, he found the parameter “a” first and them 
proceeded to obtain “log b" and "c”. 
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the cubic. 


*co + 4 ) ~0 

Hence, it is desirable to have a table which will piovide an approx- 
imation of the required root of this cubic for a given Then, 

the root can be approximated to as great it degree of accuracy 
as we wish by applying, for instance, Newton's method. That is 
why Table IV is constructed. Practically, after we obtain an ap- 
proximate value of co from Table IV, one single application of 
Newton’s method will almost invariably suffice to give us a value 
of co accurate to four decimal places. In Table IV, values ui c 
corresponding to given values of cj are also provided to serve 
as a check to our computation of c by formulae (4). 

TABLE L\ 


Tabic Facilitating the Solution ot the Cubic 
faf + 4 )« O 


co 


c 

L « 

wai'.BT — ;tt i„ 

<3 

c 

1. 

0 

0 

__ 



1.01 

3010 

1000 

1 1 

1 XX>23 

480 7 

1.02 

4271 

1407 

wmm 

1,6991 

,4889 

1.03 

.5248 

jfPfPg 


1.7356 

,4969 

~ 1.04 

.6080 


1 29 

1 7717 

5046 

1.05 

.6820 

2209 

1.30 

1,8075- 

5122 

1.06 

7495+ 

.2415 — 

1.31 

1.8429 

5196 

1.07 

.8122 

2602 

U2 

1 8781 

5269 

1.08 

8712 

.2775” 

1 33 

1 0129 

5340 

1.09 

.9270 

2936 

134 

1.9475+ 

.5410 

uo 

,9803 

.3087 

1,35 

1.9819 

5478 

Ul 

1.0315- 

3231 

1.36 

2,0160 

5545+ 

1.12 

1.0808 

3366 

1.37 

2 0499 

.5611 

1.13 

1.128S+ 

3496 

138 

2.0830 

.5675+ 

1.14 

1.1749 

.3619 

1,39 

2.1171 

.5738 

1.15 

1.2200 

.3739 

1.40 

2.1503 

.5801 

1.16 

1.2640 

.3852 

1.41 

2 1 835- 

,5862 

1.17 

1.3070 

,3962 

1.42 

2.2164 

,5922 , 

1.18 

1.3492 

.4068 

143 

2 2492 

.5981 

1.19 

1.3905“ 

.4171 

1.44 

2.2818 

.6038 

1.20 

1.4311 

.4270 

1,45 

2.3143 

.6096 

1.21 

1.4710 

.4366 

1.46 

1 2.3467 

1 .6151 

122 

1.5103 

.4460 

1.47 

2.3789 

.6207 

123 

1.5491 

.4550 

1,48 

2.4110 

.6261 

124 

158) 

.4638 

1,49 

1 2.4430 

.6315+ 

1 25 

1.6250 

.4723 

1.50 

2.4749 

| .6368 








PAE-TSl YUAN 


49 


To illustrate the use of Table IV and to help in studying the 
application of the logarithmic frequency curve, we take the dis- 
tribution of the weights of 1,000 female students from the 
“Synopsis of Elementary Mathematical Statistics”* bv Miss B. L. 
Shook. (See Table V.) 

The mean, standard deviation, and skewness for this distri- 
butionf are 


m* i/d. 74 lbs, 
4* J6 91752 /bs. 
. 976424 


To computer, we find from Table IV that for 976424 
o> is approximately c O 0 * 1JO- For a better approximation, we 
apply Newton’s method * 


cO*o) e 







3co 0 *-foi + 4 ) 
3 coj* t Sat 

w o 


• 1 . 10 - 


.007596 

10.73 


1. 10~. 000743 


• 1. 090 757 


Jy formulae (4), the parameters c , b and a are found to be 
» . 307 627 
b* 51, 2160 /bs. 

<s* 65 0423 lbs. 


* Annals of Mathematical Statistics, Vol, I, No, 1 (1930), p. 39. 
t Sheppard’s corrections have been duly applied. 
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TABLE V 

Observed and Theoretical Distributions oi the Weights of 
1,000 Female Students 

(Original Measurements Made to Nearest t/10 lb.) 


Class 

Limits 
( Pounds) 

Observed 

Frequency 

Theoretical 

Logarithmic Distribution 

Theoretical 
Type III 
Distribution 
Bv Areas 

By Areas 

By Ordinates 

70- 79.9 

2 

0 

1 0 

0 

80- 89.9 

16 

10 

! 6 

4 

90- 99.9 

82 

97 

94 

102 

100-109.9 

231 

228 

| 234 

288 

110-119.9 

248 ! 

255 

259 

250 

120-129.9 

196 

190 

! 190 

| 184 

130-1399 

122 ! 

114 

in 

111 

140-149.9 

63 

57 

57 

59 

150-159.9 

23 

27 

27 

29 

160-169,9 

5 

12 ! 

12 

13 

170-179.9 

7 

6 

6 

6 

180-189,9 

1 

2 

2 

3 

190-199.9 

2 

1 

1 

1 

200-209.9 

1 

1 , 

1 

0 

210-219 -9 

1 

0 

0 

1 0 

Total 

1,000 

1,000 

1,000 

1 1.0(H) 


Knowing c , b and a , we obtain the geometric mean and the 
mode : 

Vi mg * 116. 2563 Jbs. 

* 111, 6J£6 lbs. 

Using these parameters, the theoretical distribution of the 
weights of 1,000 female students has been computed and is shown 
in Table V and Figure II. The fit of the logarithmic distribution 
to the observed data is, indeed, excellent.* The lowest possible 
weight of female students, according to the theoretical distribu- 
tion, is 65.04 pounds, which is just about what one would expect 
after examining the observed data. 

Miss Shook*f the type III distribution to fit the same 
set of observed data /and gave the result as shown in the last cob 


* Grouping the first three classes into one class and the last six classes 
into one class, we apply the X*test for goodness of fit and find that the 
probability to get a worse fit is .70. 

t Annals of Mathematical Statistics, Vol. I, No. 3 (1930), p 242. 
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umn of Table V. The fit is not as good as that given by the log- 
arithmic distribution, especially in view of the fact that the type 
III curve fixes the least possible weight at 84.09 pounds, while 
as a matter of fact there are two students whose weights are 
below that limit.J 

From the standpoint of the method of moments, a criterion 
for the logarithmic distribution to fit a set of observed data is 
that T) * <4 - 3 computed directly from the observed data must 
be approximately the same as the theoretical 9 computed from 
formula ( 10 ). This criterion, however, does not seem to work 
in practice. For instance, for the distribution of the weights of 
1,000 female students, the theoretical *9 is 1.7419, while the ob- 
served 9 is 2.4536. But in spite of this fact, the observed distri- 
bution, as we have seen, is very satisfactorily fitted by a loga- 
rithmic distribution. 

Another criterion is to require the observed moments about 
the lower limit V' to satisfy approximately the recurring relation 

telle* 

£ M's., 

for 4 , This criterion is approximately fulfilled by the distri- 
bution of the weights of 1,000 female students, for which we have 
Ml -14 72 79 'lO* 

- 146696 

and - 1. 0040. 

The fact that a set of observed data may be satisfactorily 
graduated by the logarithmic distribution but fulfills only the 
second crite rion may be explained on the ground that the com- 

t In fact, since the finite limit of the variable for type III curve is 
777 " and for the logarithmic curves w?- cr, the finite limit 

is always greater in absolute value for the logarithmic cur\e than for the 
type III curve 
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paratively wide discrepancy between the observed and theoretical 
frequency in the classes near the lower limit maks a great differ- 
ence in the fourth moment about the mean but does not make 
much difference in the fourth moment about the point k a H . 


PART II 

THE SEMI-LOGARITHMIC CORRELATION SURFACE 

Suppose that the correlation surface of the functions, x^ffu^) 
and is a normal correlation surface and each has its 

mean as the origin and its standard deviation as the unit, Then, 
the probability that values of x will lie l>etween x and x+dx 
and values of y between y and y + dy is 

■ / 

$fcy)dxdy* € L J dxdy. (1) 


]t follows that the probability that values of u will lie between 
U and u+du and values of v between and v+dv is Ffu,v) 
dudv given by 




I 






2rr/Fr* 


dt df 
du dv 

$£ $£ 

DLL OV 


du dv . 


(2) 


Ffu v) is, therefore, a generalized correlation surface of two 
variables, deduced by extending the method of translation for 
treating frequency distributions of one variable. 

It is clear that in this general form the correlation surface 
represented by F(u, v) is of little practical use, on account of its 
complexity. Now a natural simplification suggests itself. That 
is to take x as a function of u only and y as a function of v 
\ 
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oil 


ly. By virtue of this simplification, F{u l v) becomes 


F(u t yJ- 


'* J y &? ‘ 


_i 

’ifi-r*) 






(3) 

da <aV 


which is a great deal easier to handle than before. 

Professor Wickseli has made use of (3) for the social case 
where, in our notations, x and y are 


X* 



U~<2, 

4 



which leads to the so-called “logarithmic correlation surface' 1 * 
The surface possesses the property that its marginal distributions 
as well as the distributions of a for given values of \/ and dis- 
tributions of ✓ foi given values of u are all logarithmic fre- 
quency distributions. 

Presently we shall study another case for which 



The correlation surface Ffot, i/J given by (3) then becomes: 


F'fa+v'J* 


Jrr A c fv~ aJ/T~ri 


(4) 


* In Wicksell’s paper, "On the Genetic Theory of Frequency”, the 
theory of the logarithmic correlation function is developed. In his two 
successive papers quoted in the Bibliography of this paper, the original 
theory is extended and the application of the extended results illustrated. 
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which may be appropriately called a semi-logarithmic correlation 
surface. We shall investigate its marginal distributions, moments 
and regression curves of the characteristics. 

1 . MARGINAL DISTRIBUTIONS 

Now, we shall first find the distribution of the marginal totals 
of u . This can be, of course, accomplished very easily by inte- 
grating with respect to 1/ over the range from a to 

infinity. The result is : 

r 00 1 -h*C a -*'P 

/ T^fu. \s)ctv = e (5) 

^ /ZTJ, 


Thus, the marginal distribution of u obeys the normal laws of 
error. 

■Similarly, if we integrate Ffe, y') with respect to u over the 
range from - 00 to 00 , we find at once the marginal distribution 
of ^ as follows: 


J Ffu, i/)du- 


J 




0<%> 


y 

6 / 


^rrc(^a) 


( 6 ) 


which is, clearly a logarithmic distribution and. therefore, has all 
the properties and characteristics discussed in Part I. Hence, the 
semi-logarithmic correlation surface is characterized by the fact 
that one marginal distribution is normal, while the other is loga- 
rithmic. It is needless to mention that this does not constitute a 
sufficient condition for a correlation surface to be a semt-logarith- 
mic correlation surface defined by ( 4 ). 

2. MOMBNTS 

The moment, /j'y , of the semi-logarithmic correlation sur- 
face about the point u = ? and is given by 

A*!/ • f /, fu-V fa <*) 

— CO & 

('j‘)Orr/' k /L- t k e~*dt 


( 7 ) 
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where ^ 


■“'W' 


if k is even 


if k is odd. 


Using relation (7), we can easily calculate the following six 
moments about the mean of u , rr ? u , and the mean of U" , : 

/u aj * 4 + <s)=0 

- i>*e c *fe c *-l) (8) 

6' r e* c *re c *-Jj*fr c *+SJ 

Atjj* rAcbe* 

Now, we want to solve these equations for the six param- 
eters. As before, we let co~ e c * and write * /U as //U^ Q ^ 
Again, we have u) as the only real root of the cubic. 

+4)^0 . (9) 

The six parameters of tne semi-logarithmic correlation surface 
can be written 'as ; 


A - /7T^ o ** cT u 
c*(Jog? coj? 

h /./ )ifi \i 

d *(&J toy <* Y 


rl Yf_uj±2_ 
“/ f y 


n v~(yt. i)** 3 ** ^ 

•^// C to ~J ) * 



3 u (1 ogr <*>} * 
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which furnish -us. a simple pmctical method for determining the 
parameters of the semi-logarithmic correlation mu [ace for ob- 
served data. 


3. REGRESSION OP THE MEAN 

First, let us observe that the function Ffat/J may he put 
into the following forms : 


FfuyM 






M-r r > 

TfT 


V-d 


277 /TP cfv-d)A 




2A*' 




1 - , v-d 

E lo $ b 


'-rtf 


Hence, the distribution of a for a particulai array of y is noimal . 


0 fu, —d — e ^ r 


V\_A ~c Jo § r ~b~\ 


( 11 ) 


and the distribution of v for a particular array of a ib loga- 
rithmic ; 


**&>*)* 






~2(1 r*) E Jo £ ir- r ^r] 


( 12 ) 


To find the mean of a for a particular value of ^ , we mul- 
tiply fajv) by u and integrate the resulting expression with 
respect to u over* the range from - a* to ex? . 


u «■ u fuj vj du m ^ Jogf f ( 13 ) 



58 


LOGARITHM IC FREQUENCY DISTRIBUTION 


which is the regression equation of the mean of a on 1 / and 
may be called the logarithmic regression equation. 

Similarly, the regression equation of the mean of on a is 
found to he* 

[Ze 4 fu t cr ^ ( 14 ) 

J d 

which may be named exponential regression equation. 

Observe the following points: 

(a) The regression curves (13) and (14) intersect at the 
point 

y Acr 

" m ~r 

v* be * * n? v , 

(h) When r*0 . the curves become two straight lines; 


U ~ L * 777 a 
£* 

i? s fre * 7r> y 


which show that a is independent of ^ and v is independent 
of u . We can also see this from the expression which 

becomes 




-}(*¥)*_ 




JTrrcrS-etJ 


when r*0 . This is the condition for independence of u and 
v in a probability sense. 

(c) When r*J , these two regression curves coincide, This 
signifies that there exists a complete functional relationship be- 
tween a and i/ , namely : 


V- t _ 1 
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id) we June learned from the studies on the normal 
con elation surface* r is the coefficient of correlation measuring 
die lineat relationship between x* and y = ~ lpgf(v- # ) , 

Thus, it is also a measure of relationships (13) and (14) existing 
between u and v , If we note that r may *be written as 

„ ^ r* c **)t 

<r u c 


- ^ " 






(15) 


we see that r is always greater than a.,, / <r u <r^ , which would 
be the coefficient of correlation measuiing the linear relationship 
between u and u , if we treated the correlation surface of a and 
\/ as being normal. 

The smaller the \ahte of c and K 0J , the smaller the differ- 
ence between r and /J„ /<? u . In fact, we can show, as we 

did for one variable case, that. as c goes to zero the semi-loga- 
uthmte correlation surface approaches the normal correlation 
surface. 

Incidentally » we may remark that the expression (15) is 
convenient for computing r . 


4, REGRESSION OF THU MOMENTS 

Using the well-known formulae for the moments of the nor- 
mal curve of erior about the mean, we can find at once the f-t 1 /? 
moment of <^^^about its mean: 


M$ m » f v) c?v 




S' 




Xa-rV** 


o. 


(16) 


if $ is even 
if <s is odd, 


This is the regression equation of the j-f/i moment of u about 
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the mean on It follows that the $~th standard moment of u 
for a given value of w is : 






•s.-u 


jm 


M SJ2 

i.u 


jSl 

*%{§)! ' 


if J is even 


( 17 ) 


x 


O. 


if S is odd. 


Again, by the formulae given in Part I for the moments of 
the logarithmic distribution, we calculate the vj ^moment of fa v) 
about the point m a * : 


M‘ * f v-a) * fa, v) dv 

a . u-r. s*c*r 


, d s^cr 


V/-rV 


( 18 ) 


And the regression equation of the s ih moment of )/ about the 
mean on u is : 


( 19 ) 


v ■ j 'V- v) s &J fu, vjdv 

The standard moment of v for a particular value of u is, 
therefore, 

M 

' * 3 ;v> 

1 


/ C-J) s 'We 

* ^*o 


( 20 ) 

~ e j j ~s7z ’ 

Having obtained the expressions for the regressions of the 
moments of one variable on the other, we shall now proceed to 



PAE-TSI YUAN 


0\ 


discuss the scedasticity, clisy and synagic* of the semi-logarithmic 
correlation surface. 

5. SCEDASTICITY 

From formula (16), we have the regression of the second 
moment of u about the mean on 1 / . 

= *YS-r*J (21) 


which is the same as in the case of the normal correlation surface, 
except that r now does not measure the linear relationship be- 
tween u and v , Since (21) is free of i/ , the semi-logarithmic 
correlation surface is homoscedastic, so far as the variable a is 
concerned. 

From the standpoint of estimation, we may also interpret 
expression (21) to mean that when we estimate the mean value 
of u for a particular value of p' . the error of estimation will be 
reduced if we use formula (13) instead of the mean of the mar- 
ginal distribution of a . The standard deviation of the marginal 
distribution of u is A ♦ while that of (13) is only ** 
as shown by (21). 

The second moment of i/ for a particular value of u is given 
by (19): 


M. . - * cr ^ 


( 22 ) 


which is not independent of u . So, the semi-logarithmic correla- 
tion surface is not homoscedastic for 1 / . Actually /^, K , the stan- 
dard deviation of the distribution of V for a given a , increases 
with u . 

However, the relative dispersion or relative error for the 

*The term “synagic” was used by S. D, Wick&ell to mean the re- 
gression of the kurtosis. (“The Correlation Function of Type A, and the 
Regression of its Characteristics*’, Kun&I, Svenska Vetenskapsakademiens 
Handlingar, Band 58, Nr. 3; Meddelanden fran Cunds Observatorium, Ser. 
II, Nr. 17, 1917) 
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-distributions of v for different values' of a is a constant, namely : 
„ M 




(23) 


Thus, by using formula (14) to estimate the mean value of v 

for a given value of a instead of employing the mean of the 
marginal distribution of 1 / , we i educe the relative error of esti- 
mation, for the relative error of the marginal distribution is 
The reduction of relative error is much pronounced 
when r' is large. In fact, the greater r is, the greater the reduc- 
tion of relative error and the better the estimation. Hence, r* 
measures the degree of relationships (13) and (14) between u 
and \/ . 

6. CL1SY AND SYNAGIC 

Now, we shall study the clisy and synagic of the semi-loga- 
rithmic correlation surface or the regression of the skewness and 
kurtosis of one variable on the other. 

The skewness anti kurtosis of any distribution represented 
by OsfUjVjas measured by^,^ and r) u * are, of course, 

equal to zero, since it is a normal distribution. But the skewness 
and kurtosis of any, distribution of i/ for particular values of a , 
according to formula (20). are given by: 

„ • r* **■'*■>- Jjtf< M-r % i) (24) 




which are two constants. Since the skewness an<l kurtosis of the 
marginal distribution of v are given by /e c £ ) and 

+3# +<5e +6J , respectively, we may say that 
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the distribution of \s for each array of u has smaller skewness 
and kurtosis, and is, therefore, closer to the normal distribution 
than the marginal distribution of 1 / . And it is more so, when r 
is near unity. 

7. REGRESSION OF OTHER CHARACTERISTICS 

In this section, we shall give the regression of other charac- 
teristics, such as the median, the geometric mean, the mode, the 
points of inflection and the finite limit. 

The regression equation of the median and the mode of u 
on v" are, of course, the same as that of the mean of U on k , 
because normal. The points of inflection of re 

points one standard deviation, i.e., to the left and the right 

of the mean, as this is again a well-known property of the normal 
distribution. 

. The regression equation of the median and the geometric 
mean of u' on u is given by 

- be A +<* 

„ »> 

which differs from the regression equation of the mean or the 
median of u on \/ , only in that the constant factor r is on the 
left member of equation (13) but is on the right member of (26). 

The mode of v for special values of U is 

. (27) 

The regression equations of the points of inflection of v on 
U are given by . 


which are not free of u . 
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Finally, we may add that the finite limit of any distribution 
of t/ for a particular array of u is the same as that of the mar- 
ginal distribution of v . 

8. AN ILLUSTRATION 

For illustrating the application of the semi-logarithmic cor- 
relation surface, we take the correlation table of heights and 
weights of 11,382 school boys between 5 and 14 years of age in 
Glasgow from L, Isserlis's paper, (t On the Partial Correlation 
Ratio”,* We shall treat the height as the variable u and the 
weight as the variable w . Thus, the marginal distribution of, 
heights is supposed to be normal, while that of weights is sup- 
posed to he logarithmic. 

Letting the class marks, 49 inches 1 and 56 pounds, be the 
origins of u and v' , respectively, and the class intervals be the 
respective units, we calculate the moments of this correlation 
surface:** 

m u -- 511861 class intervals 

a a w 763} class intervals 

•fe - 01 T7 

^ * 2. 5093 

m v * - .2064 12 class intervals 

0^ « 2 5781 class intervals 

- .5915 
* o4 *3. 1221 
* 4-. 205 8 75 

from which wc deduce the following parameters by formulae 

UO); 

7 = - 511661 class intervals 

^ * 1 . 7632 class intervals 

<u- 1. 03 T9 
c •' . 1929 

«s *-13.45 
b * 13.00 
r - .9340 


class intervals 
class intervals 
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TABLE VI 

Correlation Table of Heights and Weights of 11,382 School Boys 


between 5 and 14 Years of Age in Glasgow 
(Original Measurements of Heights Made to Nearest Inch; 
Original Measurements of Weights Made to Nearest Pound) 



With these parameters, the correlation surface of heights and 


weights is determined. Now, we shall examine the regression 
curves of this correlation suifaoe 

Inserting the computed parameters in formulae (13) and 
(14), we obtain the regression equations of the mean height on 
weight and the mean weight on height. In Tables VII and VIII, 
we have the mean heights for specified weights and the mean 
weights for specified heights. We see, from these tables and from 
figures III and IV, the agreement the theoretical and observed 
results is very excellent In some extreme classes the deviations 
of the observed values from the theoretical values are more pro- 
nounced. But these classes comprise only a small fraction of the 
total number of cases. 
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Now, we go further to investigate the scedasticity of the 
correlation surface of heights and weights. According to the 
theory, for any particular weight the standard deviation of heights 

should he a constant and equal to ~ J . 3<993 inches. 

This is much less than the standard deviation of the marginal dis- 
tribution of the heights, which is 5.2893 inches. That 1.8893 inches 
is quite close to the observed standard deviations is shown by 
Table JX and Tig are V. 

The theory asserts that the dispersion of weights is not the 
same for different heights. Hut for all arrays of heights the 
relative dispersion or relative error of weights is independent of 
heights, 

TABLE VII 


The Mean Heights for Specified Weights 


*" r ^ Jl “’ 11 1 m 1 11 

Mean Height (Inches) 

Weight (Pounds) 

Observed 

Theoretical 

24- 28 

34.4 

33 2 

29- 33 

36.5 

36.4 

34- 38 

39.3 

, 39.3 

39- 43 

41.8 

41.9 

44- 48 

44.0 

44.2 

49- 53 

46.4 

46.4 

54- 58 

48.5 

48.3 

59- 63 

50.5 

50.2 

64- 68 

52.1 

51.9 

69- 73 

53.2 

53.5 

74- 78 

54,9 

55.0 

79- 83 

56.0 

56.4 

84- 88 

57,1 

57.8 

89- 93 

58.4 

59.1 

94- 98 

58.8 

60.3 

99-103 

60.7 

61.5 

104-108 

60.6 

62.6 

109-113 

61.0 

63.6 

' 114-118 

.. 

64,7 

119-123 

63.0 

6S.7 
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FIGURE m 

REGRESSION CURVE OF MEAN 
HEIGHT ON WEIGHT 

M OHSeVVgD VALUE 


FIGURE 17 

REGRESSION CURVE 0 f MEAN 

weight on height 

u oas erven value 
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WEIGHT (POUNDS) 



HEIGHT (INCHES) 


figure: 7 

CURVE OF SCEPASTlCirr 
OF HEIGHT ON WEIGHT 

* OBSERVED VALUE 


FIGURE V 

CURVE OF SCEMSTtan 
OF WEIGHT ON HEIGHT 

HQtSftRVeO VALL* 
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TABLE VIII 


The Mean sVeig-hts for Specified I-Iemhtb 


Mean Weight (Pounds) 


Height (Inches) 


Observed 


Theoretical 


30-32 

33-35 

36-38 

39-41 

42-44 

45-47 

48-50 

51-53, 

54-56 

57-59 

60-62 

63-65 


29 8 
32 5 

36.4 
39.7 

44.6 

50.4 
57 3 
65 1 

72.6 
81 7 

89.1 

92.2 


26 0 

30.0 

34.4 
39 3 

44.7 

50.8 

57.5 
64 8 

73.1 

82.1 
92.2 
103.3 


According to formula (23). for any specified height, the relative 
error of weights is 7.6%, which is much smaller than the relative 


error of the marginal distribution of weights, which is 

(e c *-2)I « /95%i 


Both the theoretical and observed absolute errors or standard 
deviations of weights for specified heights have been calculated 
and are shown in Table X and Figure VI. The agreement between 
the theoretical and observed dispersions is not as good as for the 
regression of the mean weight on height. It should be noted here 
that theoretically the standard deviations of weights for heights 
over 76 inches are greater than the standard deviation of the 
marginal distribution of weights, which is 12.8905 pounds. 

In interpreting the standard deviations of weights for par- 
ticular heights, we must bear in mind that the' distribution of 
weights for any given height is not normal, but logarithmic. 
Hence, a proper interpretation of the dispersion of weights for a 
given height can be made only with reference to the skewness, 
masured by the third standard moment of weights, which, accord- 
ing to the theory, is a constant for all, different heights. The 
theoretical third standard moment of the distribution of weights 
for any given height, as we shall see later, is approximately .2. 
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TABLE IX 


The Standard Deviations of Heights for Specified Weights 



Standard Deviation of Heights 
(Inches) 

Weight (Pounds) 

Observed 

Theoretical 

24- 28 

3 52 

189 

29- 33 

2.40 

189 

34- 38 

1.91 

1.89 

39- 43 

2.12 

1.89 

44- 48 

1.91 

1.89 

49- 53 

2.07 

1.89 

54- 58 

2.04 

,1.89 

59- 63 

181 

189 

64- 68 

187 

1 89 

69- 73 

1.79 

1.89 

74- 78 

192 I 

1.89 

79- 83 

1.95 

1.89 

84- 88 

218 

1 89 

89- 93 

2 01 

1.89 

94- 98 

186 

1 89 

99-103 

1 62 

1.89 

104-108 

2 34 

1,89 

109-113 

0 

1.89 

114-118 


1.89 

119-123 

0 

1.89 


TABLE X 


The Standard Deviations ot Weights lor Specified Heights 



Standard Deviation of Weights 
(Pounds) 

Height (Inches) 

Observed 

Theoretical 

30-32 1 

4.6 

* 28 

33-35 

45 

3,1 

36-38 

4.0 

3.5 

39-41 

3.5 

3.8 

42-44' 

3.6 

4.3 

45-47 

4.2 

4,7 

48-50 

4.8 

5,2 

51-53 

5,9 

5.8 

54-56 

6.3 

6.4 

57-59 

84 

7.1 

60-62 

12.5 

7.9 

63-65 , 

■ 14 8 

8.7 
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Thus, from Table II in Part I, we find that the probability that 
any weight will be at most one standard deviation abo\e or below 
the mean weight for a given height is 6839 instead of .6826, as 
in the case of the normal distribution The difference between 
6839 ami .6826 is slight but should not be overlooked. More- 
over. the difference would not be so small, if the skewness were 
larger. 

Another thing we must observe is that since the standard 
deviation of weights for a given height increases with height, the 
probability that for a given height the weight will differ from the 
mean weight for that height by, sa>. at most one pound is not 
the same foi all different heights, although the probability that 
for a given height the weight will differ from the mean weight 
for that height by at most one standaid deviation is the same Cor 
all different heights. The former probability is greater for smaller 
heights. 

The agreement between the theoretical and observed clisy and 
synagic is, of course, not expected to be close. Theoretically, the 
distributions of weights for specified heights should all have^ 

and r) v * -J *.09. Five observed values of ^ and are 
shown below: 


Height 

(Inchest 

Observed Skewness 
of Weights 

Obsei ved Kurto-jis 
! of Weights 

Ou 

36“ 38 

22 

8.72 

42-44 

.19 

.18 

48-50 | 

29, 

.79 

54-56 

.12 

1,54 

60-62 

-.93 

*50 
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The rather large deviations of the observed 7 ?^ in the first 
class from the theoretical y v and the observed in the last 
class from the theoretical may be accounted for by the fact 
that only 350 and 69 observations are included in the first and 
the last classes, respectively 

The observed marginal distribution of heights is very sym- 
metric but is markedly leptokurtic, since its ^ is about 2.5093, 
Hence, the fit given by a normal curve is not quite satisfactory, 
as we can see from Table XI. 

The observed marginal distribution of weights is quite skew 
and platykurtic. As shown by Table XII, the agreement between 
the observed distribution and the theoretical logarithmic distribu- 
tion is not very close 


TABLE XI 


Relative Frequency Distribution of Heights of 11,382 School Boys 
between 4 and 15 Years of Age in Glasgow' 


Class 

Limits 
( Inches ) 

Observed 
Relative 
! Frequency 

Theoretical 

Relative Frequenc) 

1 (Normal Curve) 

27-29 


.0003 

30-32 

.0007 

.0020 

33-3S 

.0063 

0095 

36-38 % 

,0308 

.0332 

39-41 

1048 

0846 

42-44 

.1682 

1577 

45-47 

; .1913 

.2154 

48-50 

.1929 

2143 

51-53 

.1681 

.1561 

54-56 

.0980 

.0831 . 

57-59 

.0317 j 

.0324 

60-62 

.0061 

.0092 

63-65 

0011 

.0019 

66-69 


.0003 

Total i 

1.0000 

1.0000 
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TABI.E XII 


Relative Frequency Distribution of Weights of 11,382 School Boys between 
4 and 15 Years of Age in Glasgow 


Class 

Limits 
( Pounds) 

Observed 

"Relative 

Frequency 

Theoretical 

1 Relative Frequency 

1 (Logarithmic Curve) 

19- 23 


m j 

24- 28 

0014 


29- 33 

.0119 

.0211 

34- 38 

0640 

0564 

39- 43 

.1297 

.1041 

44- 48 

.1454 

.1441 

49- 53 

.1499 

1609 

54- 58 

,1363 

.1504 

59- 63 

.1139 

,1234 

64- 68 

.0882 

.0897 

69- 7.1 

0682 

.0602 

74- 78 

0461 

0373 

79- 83 

.0213 

.0212 

1 84- 88 ' 

0121 

0127 

89- 93 

.0058 

.0065 

94- 98 

0033 

0033 


.0017 

.0018 


0006 

.0008 

109-113 

.000 1 

.0004 

114-118 


,0002 

119-123 

.0001 

.0001 

Total 

1.0000 

1.0000 


In closing, we may say that the semi-logarithmic correlation 
surface is not at all uncommon in practice, and the method de- 
veloped here for treating it should prove rather useful. In fact, 
our investigation opens up a new way for determining exponential 
and logarithmic regression curves. 
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A SIMPLE METHOD FOR CALCULATING 
MEAN SQUARE CONTINGENCY 

Hr 

Elm ur B, Rovfk 
T he Ohio State University 

If we wish to test for a possible relationship between lwn 
variables which are not quantitatively measurable, but each of 
which has two or more categories, the usual procedure is to make 
a two-way table, giving the frequencies of all the possible com- 
binations, 

Assuming independence between the two variables, a second 
table is built, making the frequencies of each column proportional 
to the* frequencies in the column of row totals. When this is done, 
each of the row frequencies is found to be proportional to the row 
of column totals. 

The deviation of the actual frequency for a compartment as 
found in Table 1 from the expected f requeue) as found in Tabic 
2, is squared and this square is divided by the expected frequency. 
These quotients are summed over the entire table, giving us* Chi- 
square. 

The calculation of Chi-square can be made much simpler by 
simplifying the formula. 

The probability of the occurrence of two independent events 
is the product of their separate probabilities. Thus the probability 
of the joint occurrence of Category 3 of the finst classification and 
Category d of the second classification is the probability of the 
pccurrence Category 3 (which is taken to be the fraction of the 
total number of cases which fall in Category 3), times the prob- 
ability of the occurrence of Category d. The expected frequency 
of the compartment is this product of separate probabilities, mul- 
tiplied by the total number of cases. If we let ^ be the actual 
compartment frequency, f e the expected compartment frequency, 
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f r the total frequency for the row, f c the total frequency for the 
column, and N the number of cases, vve may write, 


Also, 


f * m W'W' N 



(U 


X**£ ( ti~ 
re 

f a 2 ^ ^ 

f 

T e 

(2) 

<e 


* 


Since the table must sum to A/, whether wc have filled it with 
actual frequencies, or with expected frequences, (2) reduces to, 

X=Z£-N 

re 

Substituting for f e from (1), 

z _ 

, X*£-fr -/V , 

w 

-NE&-N 

*r*c 


N 




( 3 ) 
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In order to illustrate our method, we shall compute Chi-square 
for Table 18 on page '86 of Fisher's Statistical Methods for Re- 
search Workers, Our computations are presented in the following 
table 



***, 

cam 

f 

IS 

^4*5 

u 

a ~ 

Ec ^ 

i 

i -- j 1 

* J 

- i 

"C 

Coupling 

88 

82 

i _ 

75 

j ...... , 

60 | 

305 

F, Male, 

7744 

25 3902 

tv 24 
22.0459 

i 562? 
184-126 

| .MX) 1 

1 1 8033 



■■ 

U 

Vl 

■ 

123 

F> Female, 


1156 

0(H) 



i 

m 

<U<!84 

7.2171 

■Ql 


Repulsion 


93 

80 

i — 

130 

418 

F, Males 


KM') 

20W14 

6400 

15.3110 

1 fi l X)0 

40 4306 

| _ ..._.. 

‘ 1 

% 

m 

WBM 

79 

358 1 

Fi Females ■ 

9216 

25 7430 i 

-- . i 

7744 

21.6313 1 

... .. -i 

I 1 

6241 

17,4330 

1 

Frequency 


1 

— ” — 



Total 

337 

29/ 

281) 

290 

1204 

Product 

- ! 


j 



Total 

94.5118 

73 7670 

66 2802 

73.2523 


Quotient 

.280450 

\ — 

248374 

.236715 

.252594 

, 1.018133 


The actual frequency is the first entry in each compartment. 
The square is read from a table and written directly beneath the 
frequency. The reciprocal of 305 is put into the keyboard of a 
calculator and multiplied in turn by each of the squares in the 
first row. The products make the third entries in the compart- 
ments. 
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These products are summed by columns and the sums divided 
by the frequency totals of the corresponding columns. These 

j* -e 


quotients are summed horizontally. This sum is 27 , and can 

T r r c 


be substituted in (3). For our example, 

X-*- 1204 fl. 018133 - 1, OOOOOO) 
*21 332 


This answer agrees exactly with the answer obtained by 
Fisher in his Table 19, page 87. The advantages of this method 
arc two- fold: (J) There is considerable saving of labor; (2) 
with the simplification of calculations, we have greatly reduced 
the danger of errors caused by dropping of decimal places. 



THE AMERICAN STATISTICAL ASSOCIATION 


Vol. IV 


No. 2 


THE ANNALS 

of 

MATHEMATICAL 

STATISTICS 


SIX DOLLARS PER ANNUM 


MAY 


1933 



PUBLISHED QUARTERLY BY 
AMERICAN STATISTICAL ASSOCIATION 


Publication Oific e—Edwards Brothers, Inc„ Ann Arbor, Michigan 
Business Office — 530 Commerce Bldg., New York Univ., New York, N. Y. 

Entered as second class matter at the Postoffice at Ann Arbor , Mich., 
under the Act of March 3rd, 187$. 


STATEMENT OF OWNERSHIP 
UNDER ACT OF CONGRESS OF AUGUST 24, 1912 

Publisher *- American Statistical Association, New York City, New York. 

Bditar~-H. C, Carver, University of Michigan* 

Managing Editor ^ Rattan S. Sekhon, Ann Arbor, Michigan. 

Business Manager — J. W. Edwards, Atm Arbor, Michigan. 

Qwwer^Afcierfcan Statistical Association, 530 Commerce Bldg., New York 
Cityi 



A METHOD OF DETERMINING THE CONSTANTS 
IN THE BIMODAL FOURTH DEGREE 
EXPONENTIAL FUNCTION 

By 

A. L. O'ToouS 

In a paper in this Journal 1 the present writer has discussed 
some of the mathematical properties of a class of definite integrals 
which arise in the study of the frequency function 

(IX y s **+*=,* W a¥a 

This function defines the system of frequency curves for which 
the method of moments is the best method of fitting 2 — i.e, best in 
the sense of maximum likelihood — and this fact gives importance 
to its study, The curves are typically bimodal, the nature and 
location of the modes being given by the roots of the equation 


( 2 ) 4x' 3 +3 / o i x^+ZpjX =. O. 

The first problem which arose was that of finding an expreS' 
sion for the value of the definite integral 


(3) 
If * 


r ^ 

Pi 

is replaced by x - this integral becomes 


(4) 


4 



px.*+<gx+r ) 


dx^ 


*On the system of curves for which the method of moments is the 
best method of fitting. Vol. IV, No. 1, Feb. 1933, p. 1. 

2 R. A. Fisher, On the mathematical foundations of theoretical sta- 
tistics, Philosophical Transactions of the Royal Society of London, vol. 
222 , series A (1921), p. 355. 
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or 


(S) 4- 

_C£ 

kf e where k= ^ 

or, replacing x\ja by x where, a is the positive square root of a* 

(6) 

I 6 = v | t”* (**+**>**+«* 9*>ctx, 

-CSC \ 

or 

<*» 

(7) 

Z a * f e -Cx!*+Px**Qx) dXx 

where 

p “ a P> 


No real loss of generality is incurred in studying (5), (6) 
or (7) rather than (3). For the purposes of the previous paper 
it was found convenient to discuss certain special cases of (7) 
.first, then (7) itself and later (5). Having in mind the practical 
purposes of this note, however, attention will be focused first on 
the form (5) and afterwards on (3). The transformations from 
the expressions obtained in the previous paper are very simple. 
For (5) the special cases studied and a few of the more impor- 
tant results obtained may be stated here as follows : 

Type I : 


P “ <i “ O. 

•-***<<* 

-00 % 

W k f ***• ' a **** x r 0,1 ,2 A. 
00 

* k f X* n * J e \ ° X at* =0. **• 0 1, 2. -3, , 
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u 

i T * 


f . i C!E 

‘ i- arty 


u.-E.o, 
‘ 4 





rfV 

~^r£ 


I 

4a 3 ‘ 


hence 


a‘ 


*z — * 


u *» 


■I& 2 .: 

*o 


o'* raj 


> rr* <2 




■Jln+L _ 

~zr 


O, 


n^O, 1,2,3, 


Obviously, of course, k depends upon the total frequency and 
hence if the total frequency is 



-a*#* 




£rt\[S 

rfi) ' 


This curve has a single mode located at ^^C7and is symmetrical 
with respect to the ordinate at 
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Type II: 

%‘ 0 , p=- 2 b, b> 0 , 


■l 


.Jr / e -(*4-2*6x*) d)i 


0*6* 5-la*6+ 95 ia 6 h 6 t 13-9 51 a* 6* 1 

HT 4 ! 6 ! 3 ! 


*a>rfi)(i. IUAs3l\ ....;] 


If 

Va 




* 


73- 4* 2/ 


) 




aW < 3 * 6 * 
s-4 + 9 . 5 . 4 * e* 


) 


It was shown that this integral could be expressed in terms of the 

Bessel functions and as follows: 

4 4 



r 


*&*/e 


a* 6 * 


\Aj,r^.3jy=^)\ 


where 

tirfjj/yfj 

A* — , 

w 

w _ trfjjrfj) 
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m 


If the total frequency is /V 

N 






This curve is symmetrical with respect to the ordinate at x*0 
and has two real modes located at x = £ 

Type III: p»0. 


I o . ufe- a ^* *+Q x) cJx 

- 00 

/ \~ r *' 4 + a *9*)d» 

\I& / ^ 


= (*!%£* r /Z2+*J 

2i«n*o &»)! 1 4 y 


k 


'rd) \u . . . j 

' ( C 4-4! 4*-6f 4 s -J4/ J 

*?2zC * ?.... j' 

JU 4.6/ 4* JO/ J 


A# 

This curve is not symmetrical and has only one real mode, that 
mode being located at x equal to the real cube root of negative q . 
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Type IV : The general case. 


4- 

~0O 

« JL fe~ x)j x 

a 4 

.*/"•-<**+ p ** +**>**. 


It was shown that the value of this integral could be expressed 
as an infinite series each term of which involved two Bessel func- 
tions. But, as pointed out near the close of the previous paper, 
although this infinite series may be considered a theoretical solu- 
tion of the problem, it does not lead to a simple method of 
determining the constants which appear in the frequency 

function. It is the purpose of this note to give a practical method 
of determining these constants. 

Beginning with (5) 

i o . yC <**f* 4 +P**+f*) dXt 

\ 

the moment is defined by 


a* 





4 

£4 J-oa 


( 8 ) 
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Integrate I 0 by parts, letting u= e~ a ^ X ^and d^d~° ^dx. 
Then 

( 9 ) 4. 

Divide by J 0 and multiply by ^ and the result is 

(10) ^ •-(’4u , s +2pu , 1 J. 

Start again with in the form (5) and integrate by parts letting 
u«e and < 2 V * afx. Then 


( 11 ) I a =/ra*f 

J-ce 

Divide by I a and then 


-/• a x f4u' 4 + 4 pa' s 7 ) 


or 


a^- 


(12) ^ 

4u' 4 +2pu' £ +<2^ 

Now integrate (11) by parts with u = e 

aV. {4* 4 i £px *+ x ) dm. This leads to 

2<> m ^jQ / ('96x a + J£3px 34 '<px s + 40 p *x * 

(13) sf 1* Jf 2) -&*&*+/&**+ 

+ SOpyx** ^ & ax. 

Divide by and obtain 

U~C96u^ +1*8 pu^ +64<2u‘ 5 + 40p*<J± +SOp 9 u$ +2f/uJ) 
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or 


(14) a 


30 


96 u.0 / 138pu' 6 i-84 (ju'j + 40p*u* i-dOp^u^ tld^u^ 
Squaring in (12) the result is 


(15) ^ 


7T? 


64 u ' 4 i Lfpuj + yu}) 

1 


« ■ - ■ >- w — . — ■ ■■» * 

16 u'*+ 4p*u'J+ c£ z a'^J6pu^ u' 4 H3yUjU 4 +4pquy 

Eliminating a * between (14) and (15) the equation 


(16) pf40a^ -I30u' g -BOa^p^POu^ -130*; a^) 
+pfl38u; - 480 ay Ug 184*; -340 uy 4 )4@6a;-48u‘*)* a 

Using relation (10) and 


(17) I6u'*+ 16 pay + 4p*a;* 
eliminate g from (16) obtaining 

(18) JAp*+3£p + 3C -O 


and hence 

(19) 

where 


-£±{F*-Joac 

JA 


A»3u'-du'*JJu/u' -6a'*- 3*y , 

(20) B-90u;uy i -60u?u;-35u'*+J6u;-6Ouy -31uy+60ufu’ 4> 
C -dOa'af-dOu/a/- 43*y +130*',*' aj +13u^ -dOuf. 
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In order to decide upon one of the two values of p furnished 
by (18) notice that, equating the first derivative of the frequency 
function to zero, the location of the two modes and the minimum 
point between them is determined by the roots of the equation 


( 21 ) 4x = O. 

The condition for three real distinct roots in this equation is 

(22) s which requires p< O, 

where ^ is found from (17), If then one of the 

modes coincides with the minimum point. If p*<^ = 0 then both 
modes coincide with the minimum point. 

Extracting the square root in (17) gives two values of 
differing only in sign. Now it is easy to show either by geometri- 
cal considerations or by examining the algebraic manipulations 
leading to (18) that p is independent of the sign of <g . Chang- 
ing ^ to in (5) has the same effect as changing # to - P or, 
that is, reversing the order of the distribution and curve. Also, 
changing x to leaves the even moments unaltered but changes 
the sign of every odd moment. Hence if the value of the function 
at the modal position on the left is greater than the value of the 
function at the modal position on the right then is greater than 
zero. And if the value of the function at the modal position on 
the left is less than the value of the function at the modal position 
on the right then ^ is less than zero. If = 0 the curve is sym- 
metrical with respect to the ordinate at Hence p and ^ 

are determined by (19), (17) and (22), the sign of ^ being 
fixed by examination of the data of the problem or, if necessary, 
by trial. The value of a* is then found by taking the positive 
square root in (IS). Of course (14) would give the same value 
for a * . 

Now that a^ y p and are determined, there remains only 
k to be found. If the total frequency is /V then 
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and hence 


(23) 


/V 


where the numerical value of the integral in the denominator can 
be found by mechanical quadrature to any desired degree of ap- 
proximation. For purposes of the quadrature involved here it will 
be found that the simple rectangle quadrature formula will give 
as good results as could be desired , 8 Having found k then the 
constant r is also known since 

-a*r 


(24) 


'* e 


r* 


i°9e k 

-Q* 


to ?,o k 


^a*7c 


°?< 


/o 


The, points of inflexion are located by equating the second 
derivative of the function to zero. The equation is 

(25) 

If now x be replaced by x + rr> then 



becomes 


( 3 ) 




*On the degree of Approximation of Certain Quadrature Formulas. 
Annals of Mathematical Statistics, vol. IV, No. 2* May 1933, p. 143 by 
A. U O’Toole. 
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where 


(26) 


p f - 4 fT7 
p 2 « 6m £ +p, 

p s = 4m 3 +2rr>p + j., 

» fn^-t m £ p + rrrq + r. 


L* 

The data 4 in the first two, columns of the table given here 
will provide the basis for an illustration of the method described 
above for determining the constants. The numbers in the first 
column are the classes into which the plants were divided. In the 
second column are found the frequencies corresponding to the 
various classes. In constructing the third column the origin for 
X was arbitrarily placed to correspond to the class 25. Taking 

27 x n ffx) 

Zf(x) 




the first six moments and the eighth moment are found to be 

«;« 0.4946903, 

u'= 31.16372, 

4= - 27 09735, 

2212. 973, 

<4= *10.3540, 

u 6 = " 4 ^jJ^~ = 209226- 5, 


4 This data, except for slight modifications, was extracted from that 
of W. L. Tower on the Seriation of Counts of Hays of Chrysanthemum 
Leucanthemum, Biometrika No. 1, 1901-2, p. 313. 
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Class 

Wfim 


y' 

2.5y'=y 

| WBk 

9 


— 

.031102 

.077755 

0 

■91 


m 

.300472 

.751180 

1 

11 


-14 

1,583594 

3.958985 

4 

12 

1 

-13 

4.991068 

12.477670 

12 



-12 

10.246676 

25.616690 

26 

’14 


-n 

14.831798 

37.079495 

37 

15 


-10 

16.280112 

40.700280 

41 

16 



14.482540 

36.206350 

36 

17 



11.089036 

27.722590 

28 

18 

8 


7.712195 

19.280487 

19 

19 



5.108903 

12.772257 

13 

20 

27 


3.359072 

8.397680 

s 

21 

43 


2.269766 

5.674415 

6 

22 

34 

- 3 

1.621764 

4.054410 

4 

23 

32 

- 2 

1.252760 

3.131900 

3 

24 

31 

- 1 

1.062910 

2.657275 

3 

25 

30 

0 

1.000000 

2.500000 

3 

26 

24 

1 

1.046530 

2.616325 

3 

27 

20 

2 

1.214445 

3.036112 

3 

28 

20 

3 

1.547935 

3.869837 

4 

29 

16 

4 

2.133051 

5.332627 

5 

30 

12 

5 

3.108096 

7.770240 

8 

31 

20 

6 

4.654336 

11.635840 

12 

. 32 

20 

7 

6.917724 

17.294310 

17 

33 

30 

8 

9.793409 

24.483522 

24 

34 

21 

9 

12,593310 

31.483275 

31 

35 

6 

10 

13.938151 

34.845377 

35 

36 

6 

n 

12.502565 

31.256412 

31 

37 

2 

12 

8.504388 

21.260970 

21 

38 


13 


10.196442 

10 

39 


14 

mmm 

3.185327 

3 

40 


15 

.238029 

.595072 

1 

41 


16 

.024259 

.060647 

0 


452 


180.792704 















11 13 13 31 39 33 41 X 
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Formulas (20) give 

A — 1409. 786 
7904269, 
O - 16354106 . 


Hence from (19) 

/>- - 202 . 7662 

or 


p~- 21* 48292. 

But p*~2is46292 and the value of g to which it leads do 
not satisfy the relation (22) hence use 262,7662 . Cal- 

culate from (17) and use the positive square root since an 
examination of the data shows that the value of the function at 
the left modal value is greater than the value of the function at 
the right modal value. Hen<;e 


y -29.4284. 

Formula (15) now gives as the positive square root 
a*- O. 0002640. 

Using these values for a*p, the values of the function 

++PX 3 + ) 

are calculated for integral values of x from x* -16 to x. *16 
and tabulated in column four. The constant k is then found by 
dividing the total frequency 452 'by the sum of column four. 
Hence 


By (24) 


hr 452 

792704 


Z. 500100. 


r*- 547Z. 573. 
The function can now be written 


ymZ5e aaat ** > ^* a *-7&****+*9 **6**J taking 2.S, 
or ym eoo 2 * 4 ofr*-J! 02 . 73642*^29. 4234 x - 34 72. 370 ) 
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The values of the ordinates for this function are given in 
column five and to the nearest integer in column six. 

Equation (21) becomes 

4x *- 2)202. 7862) x+29. 4284= O 

which has the roots (approximate) x=40.1, x* 0.07, x-JO.OJ 
It should be noted that the sum of the three roots must equal 
zero. Hence the modes are located at and at x* 10.03 

with the minimum point at x~<003 . These roots can be deter- 
mined to any desired number of decimal places by Homer's 
method. 

If now x is replaced by /K«-25 so that the new values of x 
are respectively equal to the numbers in the class column, the 
function becomes 


. . - ^33t.JS6X + 

y* e 

The modes are now located at x*14,9 and x=3£0J with the 
minimum point at x,*£307. In the figure are shown the original 
distribution and the curve represented by this equation. 



ON THE TCHEBYCHEF INEQUALITY OF 
BERNSTEIN 

By 

Cecii, C. Craig 1 

From Tchebychef’s inequality we’ know that if as, , 
are a set of independent statistical variables with 



* 

** 

- “ °> 


* * • 

. • • . * a* > 


then the probability P that 


- ta < x, *■ Xj + ■ •* * ter 

satisfies the inequality, 

p - ' 

This gives a lower limit for P which is often unsatisfactory. 
Improvement of this result requires further hypotheses. As is 
well-known, Pearson, Camp, Guldberg, Meidel, Narutnt,* and 
Smith* have attacked this problem with considerable success. An- 
other interesting and important attempt in this direction due to 
S. Bernstein seems to have generally escaped attention in the Eng- 
lish-speaking world, at least, since it has been published only in 
Russian -. 4 Because of the latter fact, it seems necessary to give 


‘This paper was written in substantially its present form during the 
author’s tenure of a National Research Fellowship at Stanford University. 

*For references to all these papers except Smith’s and a brief discus- 
sion see Rietz, H. L., Mathematical Statistics, (Open Court Publishing 
Company, Chicago, 1927), pp. 140-144. 

‘Smith, C. D., On Generalized Tchebychef Inequalities in Mathemat- 
ical Statistics, American Journal of Mathematics, Vol. S2, (1930), pp. 109- 
126. 

♦Bernstein, S., Theory of Probability, (Moscow, 1927), -pp. 159-165. 
The present account of this work of Bernstein is taken from a lecture of 
Professor J. V. Uspensky. 
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a brief account of this work of Bernstein’s preliminary to the 
remarks based on it the writer wishes to make. 

Bernstein imposed the condition in addition that 

(1) e(\x i \ k )± e £k! h k -*ik*t, i.1.2, • 

(jE/x ) is read “the mathematical expectation of x.y in which h 
is arbitrary. (This condition is satisfied, e.g., if the x. t s are 
bounded.) and used the following lemma due to Tchebychef. Let 
the statistical variable u be always >0. If then the 

probability Q that u satisfies the inequality, Q- 

Then taking. 



u * e + 



£x, €X, 

= e ' e *■ ■ ■ 

■e ”, 

in which £ is arbitrary, 

era) - ere ex, j £ **j - • - ■ <sve €Xn ). 

Now , , , f 
„ e**f <f 

e + 3! l *. 

and under the condition (1), 




2 

2 £ 

If it is assumed that 


then 

\£\/7£ C< 1 

cWi 

JETt] 

J 

and thus 

(2) 

Efu)< e*°‘ c) 



If in the inequality, u 2 At* , a greater quantity is substituted 
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for A , then certainly 9*7* . Therefore the probability Q of 




me 


satisfies the inequality 


<ps e 


Now 


<f fx.-b *,, + •■ • J£ ) i(l-C) 

u* e ' * n > e 

implies for £> O, 

t 

T* 

*• + *»+“ + *»*e+i7ZT/ 

The value of £ is next chosen so as to make Q a minimum, i.e., 


so as to make? * 


* to* 


£ Jtfl-c) 


a minimum. Thus 


** 2(1-?)** 
f S3 


Then the probability Q that 


7 2 

satisfies the inequality, 


- i 

x, + -z. )* 

n i i~ c J 


if s* Zd-clr* . fs £ , k4 

n t * — — — ;<?<£ with c<i. 

To get the corresponding result for the lower limit of the 
sum x f + + * * • ' + * n , it is only necessary to chooae 

£ < 0 and as before, the probability, Q ' , that 
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satisfies the inequality, 


<?■ 


if also €?• and l£l* Ji with c <1. 

Combining these two results, if P is the probability of 

\ 

then since 

at 

p+ <? + <$'* i. 


Pi l-2e 




£ 


But setting 





and also, 


the condition 






20yJ 

< 7 * 


Tr*< 


c* 

h* 


(Bernstein set € = , using merely the equality sign in this 

condition. The value of c as here given is necessary in the 
author’s developments below.) must be satisfied, or what is the 
same thing, 

2£jP cj 

2cr* s h* * 


Ci 


Ct* +/HO 


from which 



m TCHEBVCHEF INEQUALITY OP BERNSTEIN 

This last quantity on the right is positive and < J as required so 
that the constants can actually be chosen as specified. 

This gives 

cj [<Y < &*+ , 

and finally the probability, P, that 

-CJS *2+ • " X n * 4> 
is such that 

OJ* 

"***+**» , 

or setting co * ter 


(3) Pzl-2e '***£* . 

2 ht 

It is to be observed that generally the quantity —y rapidly de- 

creases with increasing n . 

This is the inequality reached by Bernstein under the condi- 
tion (1). 

„ If all the^-cr are bounded, if, say, always 

I <2 / n > 


one may take h 


3 ‘ 


It is the purpose of the author’s remarks to discuss less severe 
conditions than (1) under which the inequality (3) can be ob- 
tained, These more general conditions are obtained, however, at 
the expense of assuming quite generally satisfied regularity con- 
ditions with regard to the “tails” of the frequency distribution 
of x , which needs not necessarily to be regarded as the sum of 
n component. variables, x,, X z , • • •, . 

If we now take 


H) 


<r x 


U * e 
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we have 


the probability function of A 

4 . 

dFfr)(U£x + ~r-+ ■■■ ) ■ 

The condition (1) insures that the series under the sign of inte- 
gration may be integrated over the interval ^oe> j oa ) . But the 
series can also be integrated over the same interval if it converges 
uniformly in any fixed finite interval, which it does, and if the 

OO 

SeHeS £o & - where 

?« fyj •/ ' 

converges uniformly in the interval (- J . 

Formally, at least, 

( 5 ) £ §;*'•' > 

in which /u^ is the k-th moment about the mean of -u . If 

(6) s jfv"* h* * > hi 2, 

for some h >0 , then for h | <f j i c < 1 the right hand side 

of (5) is convergent and is < / + T^-c) as be ^ ore - Now 

let us suppose that the condition (6) is satisfied not only for 
moments taken over the whole interval f ~ 00 , ot> ) but also for 
moments taken over any interval which includes the interval (~b, b) 
in which b is an arbitrarily large though finite number. This is 
the first rcijultirity condition, mentioned above, which we shall 
impose on the tails of the frequency function of x . 
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Then it is obvious, from the remark above, that 


is uniformly convergent in the interval for \ y\< b for j left- c < 1 
And for* \y \ > t> it is also obvious that for h |£ | * c < 1, 


Z fr> 6 ') 
n*o * 


is uniformly convergent if our first regularity condition is satisfied. 
And since |<f | may be taken arbitrarily small, the inequality (3) 
follows as before. 

It is evident that if our first regularity condition holds, that 
the condition (6) is more general than the condition (1). And 
it is easily seen that this first regularity condition holds for a very 
wide class of frequency functions. For, in order for it to hold, 
it is sufficient that the frequency curve (continuous or not) out- 
side some finite interval f~l> t t>) about the mean as center, be never 
increasing as |^| increases and that if f(x) be the ordinate of 
the frequency curve at the abscissa X . always ffx)> f f-x) or 
else always f(x)< f(~x) for x > h . 

Hut if the first regularity condition be satisfied, then for all 
intervals which include (-b>b) the corresponding moments have 
upper limits in absolute value. And if this be so for all such' 
intervals, the semi-invariants (of Thiele) will also have upper 
limits for their absolute values, if is the k-th semi-invariant, 
we will take for our second regularity condition on the tails of the 
frequency distribution of ^ . that 4 

(7) I \\*T ****'* kiZ 

for some h > a if is taken for any inteival which includes 
the arbitrarily large, though finite, interval 

If this second regularity condition holds, it is again easy to 
show that (5) is an equality if c< 1. The right member 
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of (5) is still uniformly convergent in the interval (-P 3 t>)iox 
J? |<?j- c < i . For all intervals which include f-b t l 1 ) we use 
the formal identity which defines the semi-invariants of Thiele: 


(8) e 








Under the condition {7) y j5(2:J is uniformly convergent over the 
intervals in question for S?\£\z c< 1 and for these values of £ , 
(8) becomes an equality since its second member is only the first 
arranged in powers of £ , Moreover, on account of (7) the right 
member must be uniformly convergent for all intervals which 
include f-b, t>). 

At least one important class of frequency distributions satis- 
fies our second regularity condition. The distributions of charac- 
teristics in samples of A/ have finite ranges as long as A/ is finite 
and they commonly have semi-invariants which are rapidly de- 
creasing with increasing N . If such distributions approach nor- 
mality their semi-invariants of order above the second approach 
zero, in particular they may become in absolute value less than or 
equal to the corresponding semi-invariants of a Pearson’s Type 
III distribution which are given by 


A* ~ (AzJJl 


k > z 


in which 


a * 


ZM 


i or 


>y k - r/r-UJAj f 

Taking h* | it is easy to see that such distributions satis f 
our second regularity condition. The smaller the skewness of th 
Type III distribution, the smaller h may be taken. Thus in sue 
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cases we can give a lower limit for t<j) y the probability 

that \x |< t<f , which is improved with decreasing skewness cf 
the Type III distribution. By the use of the first regularity con- 
dition we could only take h* j as the distribution approaches 
normality. 

As a second application let us suppose that x * x / + in 
which as, and Xg are independent, and in which the semi-invari- 
ants of the distribution of x, are £ * * • and the 

semi-invariants of the distribution of X^ are £ f* 

Then the distribution of x has for semi-invariants 


tt+rfaVt •• •. 

Further let it be assumed that ^ < 1 , and that the distribu- 

tion of satisfies our second regularity condition. 

Then 

, P{\x \s,ta)> Pf\x, j s ia t ) P(\x z \ £ i(a-a,)) 

But 

tfa-O,))* 0 * ) 


Now 


>J-2e 


w J*Jh r/cr- a, ) 


<** 




a -a, fa** a/J 

a z * ** Kl l if o<x<J 


so that we get 

p(\*4 | * tCo-a,))> i-Zi 
This gives finally in such cases 


'iTThl. 

<3* 


P A * k *<*)> P{\x t \st<r)-2e ****** 



ON CORRELATION SURFACES OF SUMS WITH A 
CERTAIN NUMBER OF RANDOM ELEMENTS 
IN COMMON* 

By 

Ca$u. H. Fischb* 

Introduction . The study of correlation due to a common 
factor has been a more or less familiar one in the literature of 
mathematical statistics. Kapteyn, 1 in an exposition of the Fear- 
sonian coefficient of correlation, considered the correlation be- 
tween two sums of normally distributed variables, the sums hav- 
ing H random elements in common. In 1920, Rietz 2 devised urn 
schemata which yield sums with common items involved in such 
a way that the correlation and regression properties can be dealt 
by a priori methods. In a later paper, Rietz 3 considered two vari- 
ables, each the sum of two random drawings of elements from a 
continuous rectangular distribution, with one of the elements in 
common. Here, the emphasis was placed principally upon the 
description of the correlation surface. Some other aspects and 
extensions of this problem were brought out by Karl Pearson 4 in 
an editorial discussion of Rietz's paper. 

In the literature, the theory of correlation has been discussed 
principally in connection with its applications. One of the objects 
of some of the above-mentioned papers is the establishment of a 
closer connection between correlation theory and abstract prob- 
alibity theory. Such a connection would give a more precise 

*Presented to the American Mathematical Society, Dec. 28, 1931, 

C. Kapteyn, “Definition of the Correlation- Coefficient,” Monthly 
Notices of the Royal Astronomical Society, Vol. 72 (1912), pp. 518-525, 

-H. h. Rietz, “Urn Schemata as a Basis for the Development of Cor- 
relation Theory,” Annals of Mathematics, Vol. 21 (1920), pp. 306-322. 

8 H. L. Rietz, “A Simple Non-Normal Correlation Surface” Bio- 
metrika, Vol. 24 (1932), pp. 288-291. 

4 Karl Pearson, “Professor Rietz’s Problem/' (Editorial), Biometrika, 
Vol. 24 (1932), pp, 290-291. 
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meaning to correlation and would tend to make the study of cor- 
relation theory more attractive to mathematicians. With this aim 
in view, the present paper is concerned with correlation among 
sums having common elements, extending and generalizing the 
preceding papers in several ways. 

We shall assume our drawings made from a continuous uni- 
verse characterized by a rather arbitrary law of distribution. We 
shall define r? sums, each of an arbitrary number of elements, 
formed in such a manner that any two consecutive sums have 
elements in common, and inquire into the correlation between any 
two of these sums. The equations of the correlation surfaces 
will be expressed in terms of iterated integrals, the regression of 
each variable on the other will be shown to be linear, and the 
equations of the regression lines will be obtained. The coefficient 
of correlation may then be computed from the slopes of these lines. 

Throughout this paper we shall understand a probability 
function, f( 0 , to be, for all values of / on a range P t a single- 
valued, real-valued, non-negative, continuous function of / . It 
is then Riemann integrable on I? , and we shall require that 

jj f f(t)dt*l. We define the probability that a value of t , 

drawn at random from the range P , lie in the interval (a.b), 

a and b in P and b>a y to be f f(i)dt . We may then say 

j a 

that to within infinitesimals of higher order, the prob- 

ability that a value of t drawn at random lies in the interval 
( t Bachelier 8 has classified probabilities into those of 

the first, second, and third kinds, and Craig® has extended this to 
probability functions, according as P is the range (*>,<*), (O f 00) , 
and (Of a), respectively. We shall find it convenient to adopt 
this classification. 

8 t, Bachelier, “Cakul des Probabilities/' (1912), p, 155. 

°AUen T, Craig, “On the Distribution Certain Statistics/' American 
Journal of Mathematics, VoL 54 (1932), pp. 353-366. 
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I, Sums of elements drawn from a universe characterized 
by a probability function of the first kind. 

1. The correlation between two sums having random ele- 
ments in common. Let fft), a probability function of the first 
kind, characterize the distribution of the variable i . Let the 
principal variable JC t be defined as the sum of 77 1 independent 
values of t drawn at random. Further, let the principal variable 
x g be defined as the sum of /r /£ random values of the r? f values 
of t composing and of 77^ independent random values of 
t taken directly from the universe characterized by ff {)• 

Theorem I. Given the sums x / and X> z as defined above , 
xvith k iZ random elements in common. 

a) The marginal distributions of X, and x^are given,, respec- 


tively t by 

( 1.11 


■ dt „. 


and 


^00 0O 


(1.12) 




Xffa-t,,- -4 






b) The correlation surface, u/* or the simulta- 

neous law of distribution of x f and x* , is given by 

m-tr-t* v'"" '4. ^ ■ 

c) The regression curves of x £ on x t and of x, on x, are 
linear, and are given, respectively, by the following equations'. 

(Ul) 
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( 1 , 32 ) 


when t 




* ^ if ft) eft. 


Hence, the coefficient of correlation between X / and x# is 
n _ 

XfX* ~ {n f r?j)k 

Proof, The proof for the expressions for the marginal dis- 
tributions of X ( and Xj are given by Craig 7 and need not be 
repeated here. The correlation surface w* /Ffx^Xg) is derived 
by a simple extension of the same method to two independent 
variables. 

The regression curve of on x f is the locus of the ordinate 
of the centroid x z of a section of the surface for any given x f , 
Thus 

00 

f f Xj Ftef, 

(1.4) Vr4* - 

It will be convenient in what follows to use an abbreviated nota- 
tion by letting 

( 1 . 5 ) 

which is merely the integrand of the marginal distribution of X, . 
Where no ambiguity can result, & Xi will be used in place of 

Of^x /t Then Ff*?, x z ) may be written 

uy*» ■ t,r ‘"~^s ** */' '”4 ^ * 

X d %o r i 

Now let v .+’l — *, *-**,*, f Caging the variable 
7 Allen T. Craig, loc. cit., pp. 355-356, 
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from to uj (1.4) becomes 


« V \\&&-CCw&L W- - 


h El 




•*t. 



It will be noted that the terms in the numerator fall into two 
groups: those terms containing the factors , f i* 1,2, 
and those terms containing the factors i/ or - 

Further, since the order of integration here is immaterial, the 
equality of the k lst integrals of the first group follows readily. 
Similarly, the equality of the r? z - k /z integrals of the second 
group follows. The expression (1.6) may then be written 


:L7) v {^/- • -f *» < , 



.77 &)*<*,„ , 



In (1.7), it is clear that the integrations with respect to each t$j 
may be effected immediately, making use of £ ’ffvjdv^l 
In the first term of the numerator and in the denominator the 
variable v may likewise be integrated out. The denominator is 
now equal to (1.11), the marginal distribution function of )C , , In 
the second term of the numerator, va f(v) is independent of the 
remaining factors, and J vff 'Jctv is a constant which we shall 
denote by M . This second term of the numerator is now equal 
to (r? 4 ~ k fJZ jM times the marginal distribution function of 
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Hence, we have now reduced the expression (1.7) for x,g to the 
following form: 


( 1 . 8 ) 


’ Ks 1 k >s ) M ' 


where I„ = 


4 / 


jfj * 4 4 > ‘ ** ' ' dt,, 

/' ' '/ ^4), i 1 ,,, • • • <v ; n r n ,/ dt„ 

To evaluate / (? , let t„ *JC,-u-t t3 i, t . 

Then * „ 

j£" X> u > 4 « ' " 4 , « »,-* ^ 

/„•/. **•**• ■■■■ U-' J *Wr- °* u 

' f.t» u u ' ^ ***>, n- y ^*12' a u 

r-r u> *«•’" *•. »n j <**< n r i • • • **« au 

J ’ ^ U ‘ *<*’"' *h n~t^^n~r” °*^2 c * u 

The first term in the above expression for l n is equal to x, . 
Each of the remaining n,~l terms is equal to I r)f . Hence 

I n t m **7 ~ {*/ " l) * 


and 




From (1.8) and (1.9), we have 

+ ( ”*- k, t ) M. 


In exactly the same manner, we may show that 

V 


k, - ^TZ~- + fa - *>* 
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Making use of the fact that in the case of linear regression the 
square of the correlation coefficient is equal to the product of the 
slopes of the two lines of regression, we obtain 

x f** m f 

which completes the proof of the theorem. 

Corollary . If x and y are each the sum of n independent 
random values of a variable i from a universe characterized by 
ffi) , and have k of these values in common, the coefficient of 
correlation between x and y is equal to the ratio of the number 
of values of i held in common to the total number composing 
each principal variable. Thus, r X y * ~ . 

This corollary of Theorem I was proved by Kapteyn 8 for 
the special case of a normal parent distribution of the variable i . 

Illustration, As a simple illustration of the application of 
the foregoing theorem, let us consider the case where 

*/- 5, + K *4 " *// * *S9 With / *m 'ft* > aS 

independent random drawings of / from the Gaussian distribu- 
tion, 

fftj = f s? rr ) * 


From (1.11), the marginal distribution of is 

G-, »'% 

Similarly, the marginal distribution of x. is 

.1 

G^f^hf4rrJ e 

The correlation surface, m/ « Ff'x :, , x, ) , obtained by applying 
' s 

Ffa,, 


TJrr 3iJ 


a normal correlation surface with ^ , 

2. The correlation among three sums, We now proceed to 
extend the preceding theorem to more than two sums. Let us 
define a third sum, or principal variable, , as the sum of k £3 


*J. C, Kapteyn, Joe. cit. 
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elements taken at random from the n £ values of i composing x^ 
plus the sum of independent random values of t drawn 

from the parent population. It is apparent, then, that the mar- 
ginal distributions of ar, , X# , and x s , and the correlation sur- 
faces %) and be formed exactly as were 

those of x f and x:^ in Theorem I, From this theorem, we are 
at once in a position to write the equations of the lines of regres- 
sion and the coefficients of correlation for these surfaces. The 
surface •* /'V- remains to be investigated, as does the 
four-dimensional surface, v*)frC x fi x^,x s ) , which may be ob- 
tained in almost the same manner. 

Theorem II. Given fft)and x t , x 4 , , as defined above . 

Lei be defined as in (1.5). Let 


I'm-?* 1 ' ' ' '*k, 




33 




xffx- i„~- +f- ***„' 

If fftjis a probability function of the first kind , then the ex- 
pression for the simultaneous distribution of x f and x s is 


(2.1) 


' r 'w-£ 




4 ' 3 ' 

* 


• o’*,, 



where by f^Jis understood the number of combinations of c 
items taken a? at a time. 

Proof, Let us temporarily require that , We 

shall show later that this restriction may be removed. The prob- 
ability that x / and ^ as defined contain k 4s -p, fp*Q 


elements in common is 


"(ZjtV*) /(%,)■ 
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The probability of the occurrence of any given pair of values 
fx t , ^ ), that is, the probability of a point falling into a given 
rectangle, , X t + Ax f , x?, * s sum P r °k~ 

abilities of all of the mutually exclusive ways in which it can 
' occur. Each of the terms in (2.1) multiplied by a'x / consists 
.of the integral, (derived by the method of Theorem I), which is 
the probability, to within infinitesimals of higher order, of the 
occurrence of a given pair, f x, * x s ) , with a specified number 
of values of i in common, times a coefficient which is equal to 
the probability of the occurrence of this specified number of 
values of / in common. Each of the terms as a whole, then, is 
the probability that the given f X/ jXj ) will occur with a specified 
number of values of t in common. Hence, the expression (2.1), 
being the sum of the probabilities of all of the mutually exclusive 
ways in which x t and x 3 can fall within the desired rectangle, 
is the probability that this will occur. This establishes the theorem 
when k /£ i 

If k,# < kf 3 , then the maximum number of values of t 
which x 9 and x s can have in common is k i3 . The expression 
for F fx, f x 3 ) in this case, then, consists of the sum of all of 
the terms of (2.1) beginning with the term where x ( and x 3 
have k JZ values of i in common and continuing to include the 
term derived from the case where they have no values of t in 
common. Equation (2.1), however, in its present form may be 
considered as a correct formal expression for the correlation sur- 
face even when k u < , since in this case all of the coefficients 

of the terms where x, and x 3 are to have more than k^ values 
of t in common are zero. This follows from the definition 

(d)* O if c < d * Thus 



Hence, we may now remove the restriction that kj# £ . This 

establishes the theorem. 
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We are now in a position to write down the surface 


It is given by the following expression, where, by ^ 
is meant any ft values of the : 






4 


dt 9rt „ , ***<=# , +/<**/-; * 

4%-' 4%*' 

Theorem III. The regression curves of x s on x / and of xl t 
on x s for the correlation surface > defined in 

Theorem II , are linear and are given, respectively , by the follow- 
ing equations : 


( 2 . 21 ) 

and 

( 2 . 22 ) 


- k /£ k £j *, f* 7 * ** ' 


-V 


JALM-JL 




X. 


k *4 *43 **> 


C"e “ ^/£ ^ ^ 


where M is defined as in Theorem L Further , the coefficient of 
correlation between X, and Xj is 

(2.3) 


*>4 k 43 


r *'*j * n * Cv/ ’iyJt ~ 


Proof . As in the proof of Theorem I, we set up the expres- 
sion for the locus of the ordinate of the centroid of a section of 
the surface for a fixed K t „ We have 

* , '!Z**' ,r * »** )<i ** 

3 '7Z 

where F fx t , x s ) is given by (2.1). From the definition of a 
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marginal distribution, we know that J ^ F'f'x, , x s JdXy reduces 

to (1.11), the marginal distribution of x t . Let us now write the 
expression for Xj as the sum of fractions. Thus 


(2.4) 

9 ’° 


$ fry' 

***** ”s~' «tS' ****,, ' ■ %/9+' < *W / % " d * u ] 




Hereafter, we shall call an expression of the form 



a “probability coefficient.” Then (2.4) is the sum of products, 
each of which is a probability coefficient times an expression 
which is equivalent to the expression for for the simple case 
where Xj would be derived directly from x t by the drawing of 

values of i from jc t . These latter expressions, by the 1 
application of Theorem I, may each be written in the same form 
as (1.3). Hence, (2.4) has been reduced to 



/ 
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k„ 


+ M 


s 49 Z 1 '® 


+s (.** I (*'*“ 

£L *0 l rCjs-pJ l 
By the use of a well-known theorem of combinatory analysis, 0 we 
have that 

in 

. . "fe-'y U,. 

and 



J/* 


have that 7 

and / , , 


Moreover, 





■ mj * £)&■') (zl 


which reduces to 




by the same theorem of combinatory analysis. 
Hence, (2.5) becomes 

j/g 'Tgj **7 


*jr* 


^ fn* >%r ~ K 2 


**f*k ’ n Jt 

In exactly the same manner, we may show that 

fn, n z -k,g kfjJM 


x, s 


k/z 


”z 


We then obtain the coefficient of correlation from the slopes of 
these lines. It is 

' r X,Xj m 7-7 z fr?, r *U *3 ‘ 

This completes the proof of the theorem, since 

k J2 ^ k 42 


*'* m r*zyy* * 

3. The correlation among p sums. We now extend our 
discussion to p principal variables, forming each successive one 


»E. Netto, “hehrbuch dcr CombilMlto^ifc, ,, (1901), pp, 12-13. 
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in the same manner in which and were formed above ; that 
is, #/, "" pJ , is equal to the sum of k^j, i random 

drawings of i from the constituent values of t forming , 
plus the sum of 77/ - fy . 4 / independent random drawings of / 
directly from the universe characterized by The correlation 

surface, w = ##) , can at once be written in the same 

manner as the surface considered in Theorem II. That is, each 
term of the expression for ) > multiplied by 6 

consists of an iterated integral which represents the probability, 
to within infinitesimals of higher order, of the occurrence of a 
given pair, fas*#)* with a specified number of values of t in 
common, times a probability coefficient which represents the prob- 
ability of the occurrence of this specified number of values of / 
in common. This same method may be employed in writing the 
correlation surface for any pair of principal variables. The ex- 
pressions for the probability coefficients, however, become increas- 
ingly complex as the number of ways in which the two principal 
variables can have O f values of if in common increases, 

The following theorem can be proved by mathematical in- 
duction. The proof is not difficult, though tedious, and on that 
account will not be presented here. 

Theorem IV. If f(i)is a probability function of the first 
kind \ and Ffaf, ) is the simultaneous law of distribution of 
X/ and then the regression of on K p and of on are 
linear and are given , respectively , by the follminng equations: 

(3.1) £ pm k Li ki i'": kp : hp X/ + d^£ M , 

n / ^ "* ,m ’ n p-f 

(3.2) Z, *****'"' ‘‘P'lP ^ + - 

Further, the coefficient of correlation between k ( and x, p is 
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II. Sums of elements drawn from a universe characterized 
by a probability function of the second kind. 

4. The correlations between two sums. Let a prob- 

ability function of the second kind* characterize the distribution 
of the variable i . Let the principal variable x t be defined as 
the sum of * 1 , independent values of t drawn at random. Fur- 
ther, let the principal variable be defined as the sum of 
random values of the ??, values of i composing x f and of 
independent random values of 1 taken directly from the universe 
characterized by f( tj. 

Theorem V. Given the sums and as defined above 
vnth ^ random dements in common. 

a) The marginal distributions of x, and x 3 arc given, re- 
spectively, by * 

/ Xf 4 / *4 r?~2 

I / fftj-fft ) 

V ''o J o f 

xf fa ** r>rt ' ***» > 

and 


G^CxJ 


■tfT 


*<**+■* 




b) The correlation surface, vs* , which is in ju-o 

distinct parts joined along the plane x - x - O , is given by 
(4.2a) 

£«ri-‘-% n .ZfX x -*„ 

fr* V4v'--***V„ , 

/ */* r t/* *“ r e, r> t -*s 


rM£% 


*!/• ••* *4 »,-/ ) 
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if/, *//r /Jg * **,*#+* *'" "* 

X 4-/ 


^ ^ ^/ * **° ) i 


(42b) 


r*br*/~*// f'V/'t/ *t>n t -2 r^V^r" t*u 


- / , r^' -t>r*r*r 

r ^HJ c •/ l 


( 


O VO 


— -*>^*45,^/ 


j* *s tr ■-*/*- *1^/"-%^- 


X &fii/,t t/ , ••■ if, *'./ ) ff-Xg- £,,t '"^/r,jf ‘ t* J *',x* /j ” ’ £j ■5-''^ 

«%-/*** **— 
fa S *,<*>)• 


X ctt*„ ,--• Ctt. 




c) rte regression curves of <% on x, and <?/ -4 on 4 
ars linear and are ptt/en, respectively, by the following equations: 


(1.31) 

x 4 * n, + f ^ ) M , 

and 


(1.32) 

*, -•%£ + C»r *»)*. 

where 

M - J / fftjdi. 


Hence t the cofficient of correlation between 4 and ^ w* 

Proof, The proof for the marginal distributions of a?, and 
of ^ are given by Craig 10 and need not be repeated here. The 
expressions tor the correlation surface are derived by a simple 
extension of the same method to two independent variables. The 


A °AHcn T. Craig, loc, dt., p. 356. 
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limits of integration may be easily verified. 

As in the proof of Theorem I, the regression of x^ on x f 
is given by the locus of the ordinate of the centroid of the section 
of the surface for a given x t . However, as the surface here is 
in two distinct, but connected, parts, we have two terms in both 

numerator and denominator. The expression for is 

/ *, 

Xj f? Yx„ Xgjdxg +/ JCg Xg JdXg 

(4.3) Z 4 * & ' 

f V fa, x*)c(Xg +f K,fa,, Xg )dxg 

where and ^ are defined by (4.2a^ and 

(4.2b), respectively. 

In the paragraphs immediately following, we shall be con- 
cerned principally with interchanging the order of integration, 
with the accompanying changes in the limits. It will be convenient 
to write the differential immediately following its respective 
integral sign. Consider the first term of the numerator. Sue- 
cessive interchanging the order of integration between inte- 
gration with respect to and with respect to 4/ * 4* » * ■ • 
respectively, and making tjie appropriate changes in the limits, we 
get, writing S for f Xg, ••• ^ J 




/A .. f*r T * S*I 

( • )Jdt n J di'g -J ^"'tzA , <2 ^ 


o o ~o 

r*r**r' m “ 4 . 


/ *r T n r *!-*!/ 

^v '7 


'u 


* 5 v' 


l 


k ) +r"~ t 4.n,-a 


X **#*,*Xg- 

Now consider the second term of the numerator of (4.3). As 
he limits are constants with respect to the variables of integration 
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X z , t i( * we ma y interchange the order of integration 

successively until we have 
(4.5) 

/ */ r°° 

a l ****“$ dt,k '*b t d * z 

p'*" '* ”>-■* ‘ 


v O ”0 


dt *. *&** 


r* " r *«u f 

l M **H 

f**-*!/ *'*,/ % k /Jt *f ~"~*4 V* 

J ***& ** *** • 

We may now combine the first and second terms, (4.4) and (4.5), 
getting 

/ *■' f */-*// s*> 

<**»/ <***■■] d *'*,»£ + . dx * 

•4 Jo * Jt.,* *,*+—+*,*. 

. * -i- mJrn - - 

/ 


/ 




^ 



*U-/ 


^*4? / 



d ***'S*'J 0 


*ftir ■ 

V 7 " 



*.*,*+* 


As the limits of integration are constant with respect to the 
variables ^ and we may at once interchange 

successively the orders of integration with respect to ^ and with 

respect to ^ ^ ^ +j, * ^ n ^/ , respectively, making 

the proper changes in the limits. We then have 
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The denominator of (4.3) may be reduced to this same form 
except for the absence of the factor in the integrand. 

Let us make the transformation ' 


V 


Xj - tf, »f-/ r 

as was done in the proof of Theorem I. The limits 
to oo on x z now become O to oo on V . We have now re- 
duced (4.3) to the following form: 

(4.7) 

W '/*/”" 

^y." y V’X-‘ 



The denominator reduces at once to G(^x t )\xy (4.11). As in the 
proof of Theorem I directly following equation (1.6), it will be 
noted that the terms of the numerator fall into two groups : those 

terms containing the factor ^ and 

the terms containing the factor v o r > /j* 

As the limits of integration with respect to each of these letter 
variables are 0 and oa , and since complete interchangeability of 
the order of integration is then permissible, it is readily seen that 
any two of these terms are equivalent. The sum of the 

entire group, then, may be written 
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In (4.8), it is clear that the integrations with respect to each 
may be effected immediately by making use of the hypothesis 

/ Q9 

. This leaves K ffvj remaining as 

the integrand. The / f(\sjdis is a constant which we shall 
Jo 

designate by M . Removing this constant from under the integral 
signs leaves us merely tne expression for the marginal distribu- 
tion of times We then have 

/X . jZ* a n fftfjfv 


^4 <&> 


^• 9 > 7* — 7-^ 

That each term in the summation in the right member of 
(4,9) is equal to any other term in the summation, follows from 
the complete interchangeahility of the order of integration of any 
two consecutive variables, provided a corresponding interchange 
between these two variables is likewise carried out in the limits of 
integration. By successive interchanges of variables we may put 

the original 4/ » t* * m an y order we choose. Hence, 

the sum of the last k i9 terms of (4.9) may be written as k fZ 
times any one of them. For definiteness, select the one containing 
the factor *>/ in the integrand of the numerator. We may now 
integrate out all of the zf , f j = and the i/ exactly 

as before. Equation (4*.9) then becomes 






4/ 


JX/ 




‘•i* r,- J 




It is not difficult to show that I™ 




x, 

ttt 


Hence, we have 


In exactly the same manner, we may show that 
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The coefficient of correlation between X t and X z is 

Kz 

(r?, rtgj® * 

which completes the proof of the theorem. 

Illustration. Consider the two sums, , and 

- t N v- , with 4/ ; 4* > 4 a* > as random drawings 

of t from the distribution characterized by the function fCi)*e* 
for t on the range O to co . From (4.1i), the marginal distri- 
bution of X>{ is 

& f fx, Xj e~ 

Similarly, the marginal distribution of x z is 

Cs fxjJ * Xj * '**. 

The correlation surface, obtained by applying (4.2a) and (4.2b), 
is 

fifa. XjJ* ff e '**J, Co S *2 i ); 

and 

/jCx/.-XjJ* e *C/-e 0, Cx,f *2 < *>)■ 

5, The correlation among more than two sums. We shall 
state, without proof, the following theorems. 

Theorem VI. Given a probability function % f(i), of the 
second kind, and three principal variables , x f > , defined 

as for t Theorem //, Then the correlation surface w.* Ffa ft x 3 ) 
is given by 


(5.1a) 
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fXy*" % 

Jo 

£ x, < oojj 


and 

(S.lb) 




/ 

/ 

/ 


W/-W 






**■*.*43-9+''" ‘ 




” ly, -«!. 






*3 


•V* 

dt 


/' ' r< *■» ’*> 0, *„-? v /-^ 
e *. t , , , *•; 

Theorem VII n, . fa****"). 

n vi j. I he regression curves of „ 

\ ° n * °f tk * relation surface in Tk ' " * ^ ° f 

md are 9iven, respectively by the toll • VI ° r * N * ear 

(2.21) \ b y th * following equations: 

4« 

o«af ' -* ~^5 “ ' 


( 2 . 22 ) 
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where M is defined as in Theorem V . Further , tfu? coefficient 
of correlation between x, and x $ is 

^ ^ r 


( 2 . 3 ) %*/ r *>~* 

Theorem VIII. The statement of this theorem differs from 
that of Theorem IV only in that fft) is now to be a probability 
function of the second kind. 

1 III. Sums of elements drawn from a universe characterized 
by a probability function of the third kind. 

6. The correlation between two sums. We shall now con- 
sider principal variables defined as the sums of values of t drawn 
from a universe characterized by a probability function of 

the third kind, defined on the range O to a , and with 

JWtJctt « /. 

The correlation surfaces are not developed with the same degree 
of generality as were those in the preceding pages because of the 
tediousness of the labor involved and the complexity of the corr 
relation surface, which may consist of many sections joined to- 
gether. Thus, if x is the sum of m values of / and y the sum 
of r ? , all drawn from a universe characterized by a probability 
function of the third kind, the correlation surface, iv* r&y), 
consists of A?/6?>7^sections, each having its own equation. Hence, 
only the^case where x and y'each consist of the sum of two 
values of t , with one of these held in common, will he considered 
here. 


Theorem IX. Let a probability function of the third 
kind, characterize the distribution of a variable £ . Let the prin- 
cipal variables x and y be defined by the relations + t fJS > 

y- t„ + , where s * we independent random 

drawings of i from the universe . 

a.) The marginal distributions of x and of y arc given by 
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(6.11) 

f * 

&, fe) = j frtjffx-Odt, 

Jo 

fO ^ x £ a) ; 


f<2 

«/ fft) ffx-t) di, 
Jx~a 

(a '5 ac § 2a); 

and 

(6.12) 

Gify) fft) ffy-t)dt. 

(OZy^ a); 


-f fft)ffy-t)dt, 
J y-a 

fa § y §= 2a). 


b) The correlation surface, w * P'fyy) , is given by 


( 6 . 2 ) 


Ffoy) - frft) ffx- 1) ffy - t)dt, fo SySAs Sa) ; 

* / fft)ff x-t) f (yd) dt, (O § x sysaj; 

Jo 

f* 

* / f(V ffx-t) f f y- i)dt, ( asy Sx+as2a); 
* y -< » 


' f fft) ffx- t) f (y- 1) ait, ( 0^x-a%y & a) ; 
Jx-a 

'[ f(t)f{x-t)f(y-t)dt, ( aSy SxS 2a); 

Jx-a 

‘ j C*-t) f(y-t)dtr, f a%x sys 2a). 


In a) and b) above, the subscripts have been omitted from the i ff 
c) The regression curves of y on x and of x on y are 
linear and are given , respectively , by the following equations; 

( 6 . 31 ) y* % + 

(6.32) and 

where M* f fftjdi. 

Jo 

Hence , the coefficient of correlation between # and y is ^ , 
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This theorem is a direct generalization of Rietz’s paper in 
Biometrika cited in the introduction to this paper. The proof may 
be supplied by the reader. 

Illustration. Let us consider the rectangular distribution 
given by f(t)* 5 , for / on the range O to a , and a to O. 
This is the parent distribution in Rietz's case when a - / . From 
( 6 . 11 ), the marginal distribution of x is 

&, Cx) * -£*> (O I # i a); 

fia-x) 


a* 


(0% x 

(a £ x 


2a). 


Similarly, the marginal distribution of y is 


G^fy) = ~s> 

- 

a* ' 

The application of ( 6 . 2 ) yields 

y 




' TP* 
fx-yt-a) 

* 5 , 

a 3 

(y~x + a) 

’ 

(2a-x) 

‘ a-> * 

( 2 a-yj 

’ ~~pr> 


(0% y% a); 

(a t y% 2a). 

(OiySxSa); 

( O'ixi.yt a)\ 

ft 

( aiyix+a$2a)] 
( Oix-aS y£ a); 

( a 5 y £ x% 2a)~, 

( yi 2a). 


These results, obtained directly by the use of Theorem JX, agree 
with those obtained by Rietz in the above-mentioned paper, 

C<»JL H . . 



ON THE CORRELATION BETWEEN CERTAIN 
AVERAGES FROM SMALL SAMPLES* 

By 

Ali,En T, Craig 

1. Introduction. It is well known that no correlation exists 
between the arithmetic mean and standard deviation of samples 
drawn at random from a normal universe. However, there seems 
to be in the literature no treatment of the correlation between 
other averages either for normal or non-normal universes. In the 
present paper, a few simple theorems are established which make 
possible the determination of the type of regression of the median 
on the arithmetic mean, of the range on the median, and of the 
range on the arithmetic mean. In case the regression is linear, 
the coefficient of correlation may be computed. 

We shall understand a probability function f(x) of a real 
variable x to be, for all values of x on a range of P a single- 
valued, non-negative, continuous function with 

Then / f(x)dx is the probability that a value of x chosen 
J a 

at random lies in the interval (a, b) where a and b are in R 
and a< b \ and f(x) dx is, to within infinitesimals of higher 
order, the probability that a value of x chosen at random lies 
in the interval fa, x-t-dx). It will prove convenient to classify 
probability functions according as R is the range /‘os, oo), f 0, an), 
or CO, k), k>0 . In accord with this classification, 1 we shall 
refer to probability functions as of the first, second, and third 
kinds respectively. In a similar manner, we define a probability 
function F fa,y) of two independent variables. 

♦Presented to the American Mathematical Society, Dec, 28, 1931. 

J Cf. L. Bacheher, Caleul des Probability, p. 155. 
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2 . The correlation between the arithmetic mean % and the 
range W. 

Theorem L Let ffic) he the probability function of the vari- 
able x . Let tffaWjbe that of the arithmetic mean % and the 
range W in samples of three independent values of x . If ffy) 
a probability function of the first kind, then 

Fa, wj-iaf 3 ffx,)ffx r wjftex-ex'+wjoix 

Proof. Let x i , , x 3 , be the three observed values of # , 

Write 

Xf + + )Cj * 3%, 

* W, 


For 2 assigned, , and V/ assigned, O £ V/<oc wc 

must have 


2 v- 


^ _ 

-n- < X< X ' + 


2W 


^ * 3k - K f - & 3 . 


If we consider all possible arrangements of x n ^ , we have 
F Yk W<3k dW= 6 f ffy )f(xJ fYxJdx, ofx <Ax . . 

Lf * 

Let 

x i = x i 

Xj * 3% ~Xf“ Xj , 

-V * f -W. 


The absolute value of the Jacobin is 3. Hence the theorem. 
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In the case of samples of four independent items x n 
the probability function fij is given by 


„ iv 

/?/% WJ* 48/ / ffx/ffrj )ff4Sc-£x, -X, * Wjffr r W)d^ </x, 


4 * 


x, + <22T 4x-3x,-HW 

<-48/ / ffx^f/x^) ff4% -2x t - Xj + Wj f fx f - Wjo'x^ e/x f . 

, w J Tf , 


~*+f 


We note that the probability function is made up of the sum of 
two parts depending on whether x f is in the interval ( x + , 

x+g)or in the interval /%+^r Moreover, it may be of 

interest to note the overlapping of the ranges of integration of 
* To prove that rffaW ) is given as stated, we take 

V- Xg V- v- * 4 X , 

0) * 3 > *2* *1> 


From (l) it readily follows that 

(2) 8x 1 + Xg + - 4% + W. 

For assigned values of x and TV, the upper limit on x ? is found 
from (2) by taking = a^ = x t - W , Thus x 1 *x+ ~T~ * 

Similarly, the^ lower limit on x f is found from (2) by taking 
* *3 * *t • Thus x f « x + . But may not always 

be as large as x f for all values of ^ . This may be seen by taking 
^ * *7 and in (2). This leads to ^ ^ /■ . 

Thus, for * - x t £ x + , we see that at, is the upper limit 

on Xg . To determine the lower limit on x £ for this region of 
variation of , we select x t z as near - x i -IV as is possible 
without causing x 3 to exceed x f . But x 3 * 4Z~£x f - ^ + TV . 
At most, then 4x-s?x f -Xj + W~ x f or 4jc- 3x 1 + V* 
Thus we have established the limits of integration used in the 
first part of the sum of which Wj consists* A similar argu- 
ment shows if x /- J Y 5 x 1 5 2 v- ~jf' , that 


X,- VS s Xj s 42 -JXj + 2V 
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If fix) is a probability function of the second kind, we 
observe in samples of three independent items x ft x z t f for 2 
assigned, that 0 < WsJx . If O* TV's 3*/ 2, we have 

ar 32 " 2X f + 112 

- W) 


and if 


< VS £ 32 , we have 

Ws >!,Si + 


Accordingly, 


* J j? ~ v* 11/ 

= x f - WT 




Fife, Wj^iaj f(x,)fC*.-W)fteZ-2x, * V/)dx, , OsWa 

J >c+Y 




- 18 Efx, ) ffx, - Wj ffaz-lx, + Wjdx 1 . 'fa Ws 32: 

J W 

In samples of four independent items x f , ^ ^ , x 4 , drawn 
from a universe characterized by a law of probability of this kind, 
we find 

r*+Wf x i 

8} fa, Wj* 48 1 yy f ffzjffa^)f84x -2x t 

2 x y> j^r v ** of /■ 

*je - jx 1 +i w 

+43/ / ffx,)f fz )ff 42 -2% -x + W)ffc~ Wjdx , 


'*+¥r*f 


OiWi 


r x +T 

*43/ / ttyfWfr4X^-*.+vrJfr* r irM* t ' a*, 

W ‘ J 4%-3X f + W h 
.2+'W r 4x--t*,*-SU' 

+4*1 w f fC* 1 )frxJff42-*x i -x,+\Vjf('x r W’J ctx,c/x f , 
lje*¥ Jx.-W * ' * ’ * * 


f * + '^y 49 - Jx, +2W 

=4 ®/ J W* f ) ?f*i 1 ff 4x -£x f -x £ +Wj ffy - Wjctx* dx t . 
22 s US< 4.x , 




V 
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Finally, consider f/k) to be a probability function of the 
third kind. In samples of three independent items » 

for O < x < k/3 , we obtain O <\ Vs3x ; iovk/3s xs2k/3, 
we obtain O s M/W /r ; for 2k/3< 2 s k , we obtain 
Os TVs Of It is fairly easy to see that for x and IV 
assigned as indicated, the following regions of selection of x f are 
valid : 

for O s X s k/2 and Os IV < 3x /2 t 

or for k/2 < 2 s k and O <W< 3kk- Sc ) / 2 f then 

vc+W/3s x f s x + 21 V/J; 

for 0 s 5c < k/ 3 and 3 x/ 2 's W s 32, 

or for kfj < 2 * k/2 and 3 x/2 s W s 3kk~xj/2 } then 

W* x f * x +2W/3; 

for 2k/ 3 £ 2 & k and 3k k-5cjf 2 < XV * 3kk~xJ 
or for k/2 $ x s 2k/ 3 and 3 k k- 2 )/ <? * 1 Vs 35c/2 , then 
x + IV/ 3 s x f < k ; 

for k/j < 2 < k/2 and 3k k-xj/2 < TVs k, 

or for k/2 s 2 < 2k/ 3 and 32/ 2 s Ws k , then W r sx f < k. 

Thus, 

- jew 
r*+ 3 

r r k*,Wj~ IS / ffo J fkx r Wj fk3x-2xk W/3x r , 

-Id/ fk*,)fkx f - WjfkSx -2x f + W)dx f , 

J w 

*18/ f/x r ) ffo f - WjfkJx-JXi + lVjcte, t 
J *+¥ 
r>r 

* Id I f - Wj ff32 - jgju, i- Wj4x r> 

over those regions of the xW -plane indicated above. 

In case of samples of four independent items x f 9 , x 3 9 

( drawn from a universe characterized by a orohnKUu*. r • 

^i-- — ■* « * * - 



f BBn ' Em cmT '"» WRAGBS 

- * for * . * 
of «— ky iv::zt ,o "°"' s ^ 

- o 

(A)jU/= 4% 

W=&j£ijcl 

IW J 
(fyjVmJJB 

lw=# 

W- 2 Z 
(C) j w= 4/^? 



V- ^ 

( D ) V- 2 f h-i) 
W= ~CAt £J 


Further, let 


[V/ - *f- 
(U) V/ = 2fk-ic) 
IW - 4-Ck.Jt) 

t*-¥ 

(F)UV 
\W 

w - 2 r*- 2 j 

(V » k 

( a )\w~ 

3 ~ 

( u )jw - ^ 

IW --= 


and let 


fl *’ Ur VKrWm^ tW1 




a - c 

11 “ """ « ««"* ,o verify ,h M 


^ a! 
a C J 


ft '(kwj. 4 m. 


ll w 


x *+? x,-w 

IV 


4 *- 3 x, +£W 


>& , (A) 


z *, \ /f.JW J 


*46\ 


4 *- 3 x,+w 
W 


x,-W 


: 4S [( 


x- f -W 


4 




(C) 




x r W 


\ CD) 



'48 


*48 
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l 1 

1TX 

4^* 

x> ) 
4x-3x f + V/J 

0 , 

(E) 

i — i 

T*i' 

*' )' 
4x-Sx 1 + W) . 

]' • 

(F) 

i t 

4x-Sx, +<2W\ 
x r W / 

K 

(G) 

$ 

*' ) 
4x-Jz f + Wj _ 

]'• 

(H) 


* 48 


As illustrations of these theorems, let us find the correlation 
between the range and the mean for universes of specified types. 

O i t K < CO . 


Example 1. Let 

For samples of three items, we have 
. FjX'W)* 6 We 3Z , 


OsWs X' 


The distributions of the marginal totals of W and x are obtained 
by integrating /f/% W^with regard to x and W respectively. We 
readily find 


<pfx) 


27 x -3* 


OX X> < 04 , 


and 


))f(Wj*2e 2 OxW<oo f 


as previously given by jhe writer. 8 For oc assigned, the mean 
of the array of W is W% « 4^ . Thus the regression of 

- . .. - ./73r * 


on £ is linear and r * 

o 


American Journal of Mathematics, VoL 54 (1932), pp. 359, 366. 
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Example 2. Let, fM s l/k, O - x < k. 

For samples of three items, we have 




6W 

~P ' 


id 

• 7 F 







over those regions of the 55 IF- plane indicated above. The mar- 
ginal totals 8 are distributed in accord with 




27 x* 

2k* ’ 




9 

'IP 


~ &%*+ 6k%-k 


']• 


±2 6 



and ^(W)* urj' 0 < Ws * _ 

We readily find 

Wjg * 0 3 2 i j > 


a Cf, H. L. Rietz.On a Certain Law of Probability of Laplace, Proc, 
Int. Math. Congress, Toronto (1924), pp. 79S-799. 

J. O. Irwin, On the Frequency Distributions of Meaps, etc., Bio- 
metnka, Vol 19 (1927), pp. 225-239, 

P, Hall, The Distribution of Means for Samples of Size N, Bio- 
metrika, Vo!. 19 (1927), pp. 240-245. 

J. Neyman and E. S. Pearson, On the Use and Distribution of Certain 
Test Criteria, Biometrika, Vol. 20 (1928), p. 210. 
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m £k s -27k*Z+27kx* k < < 2k 
6k * - 36 kx + 36 x* ’ 3 ~ " 3 * 


Thus the regression curve of W on 2 is continuous, but the 

A - 2 k 

regression is non-linear for -j $ x < -j- 

3. The correlation between the arithmetic mean x and the 
median £ . 

Theorem II. Let ffx) be the probability function of the 
variable x . Let ?)be that of the arithmetic mean x and 
the median £ in samples of three independent values of x • If 
ffx) is a probability function of the first kind , then 


(*, V~18 ff f f )f ff*,) ff 3 k - C- * f )dx i , f < k, 


'S2-AZ 


-ldffOf ffxjffsz- 

Proof. Let x 1 , , x s , be the three observed values of x . 

Write 


+ X Z +*3 * 3x, 

X 2* ? 

X 3* X * ~ * 1 

For x and % assigned, f < x , we must have 
3x - 3 ff < X f < oa 

K s =Jx- C-x t , 


f & X f < GO 

X 3 ~ 3x - C~ . 


and for x < 
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If we consider all possible arrangements have 

-00 

%faf)c/xd$ * ffx t ) ffr 3 ) dx, f £ E, 

«6f/ r ?)ct?J°frx 1 ) ffc 3 )dx, dXj, x< t 


The change of variable x s ~ Jx-f-x, establishes the theorem. 

In case of samples of five independent items x 1 , x z , x ^ , 
% f the probability function /f/x, fJ is given by 



This follows immediately from the fact that for x and f as- 
signed, £ < x , we may have either 

4>£ 

Sx ~3f x f < < t», 

•%.« Sx-f 'X,-**-**. 

or 

dJi - 4-f £ X f -coo t 

fs % <<», 

Sx -Xj < %* i? 

* 

X S *5Z -f'Xr-Xz-%, 
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and for x < % , we must have 

£< JCj <oo 

%<%< 00, 

-XgS x 4 * <f, 

•*Jr « Sx.-f- x,-Xg- ■%■ 

If -f(k) is a probability function of the second kind, it is clear 
that 0< <f £ in samples of three items. Then 







Oi?< 2, 


! 


Xifs 


32 
2 ’ 


In case of samples of five independent items drawn at ran- 
dom from a universe characterized by a probability function of 
the second kind, can best be expressed in a form employ- 

ing the notation used previously. Thus we write 
$* JffJZ-f- x t -x£- x+J, 


and 


Then 


= Jx-if - x,-Xg 




t>*d~ * 


fjf 

ct c e 


& at a'x 1 


t> Gt f 


ace 




i\r 


a 


30 


*30 


U 3J 


U 31 


U 11 


$+ 


11 






U J1 

“31 


U 


12 

a 


*12 


* 


o * it* 




/ 
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Finally, consider f{2) to be a probability function of the 
third kind. In samples of three independent items, for Os 2<k/j f 
we obtain 0< / * 32/2 ; for /c/3 s 2 * 2 k/ 3 , we obtain 
/32-k)/2s/< 3z/2 \ for 2 k/ 3 s 2 « k , we obtain (32-k)/2s/s k 
It is not difficult to verify for x and f assigned as indicated, 
the following regions of selection of x f are valid: 
for 0< 2 s k/3 and Os / < 2 f 
or for k/J s 2 s k/2 and 32- k < / < 2 , then 

32-2/*x,*32-f} 

for k/3s x < k/2 and /32- k//2 s / < 32 - k , 
or for k/2 s 2 s k and /32-kj/ 2 s / s 2 p then 
32-2/ s X f $ k ; 

for O s 2 s k/2 and 2 s / s 32/2, 

or for k/2 < 2 < 2k/ 3 and 3 2- k s / < 32/ 2 , then 

? * * 3* - ? ; 

for k/2 s 2 s 2k/ 3 and 2 s / < 32- k, 

or for s 2kj 3 «? 2 * k and 2 * / < k , then £ < x 1 < k 


Thus 


, 32 -/ 


tetftjf ffr t )ffJ 2 - r~*r 

32-2/ 

* 16 fCCJ f fk f J /CJ£- r- x,J e/x , , 


j*-sjr 
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-ldffej/ 

T 

*16 ff?)f ffyJffJx-f- Jdx 1 , 

f 

over those regions of the x % -plane as indicated above. 

With samples of five items, the correlation surface is defined 
in so many parts that we shall not take the space necessary to 
consider it. 

As illustrations of these theorems, we shall find the correla- 
tion between the median and the mean for universes of specified 
types. 

Example 1, Let ffx)* e Os x < po . 

For samples of three items, we have 

^ r*. r) - is? e - *f 2, 

- ISfJX-JfJe ' J f 2* fs~ 

The distribution function of the marginal totals of % is given by 4 
Oe Os <? < oo . 

For x assigned, the mean of the array of ^ is 

* 6 

Thus the regression of ^ on is linear and 
Example 2. Let ffx)* ~~ , O < xz s ft m 
For samples of three items, we have 



4 Cf. American Journal of Mathematics, Vol. 54 (1932), p. 364. 
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IS 

k 


3 rsz-Jtr), 




over those regions of the 5 £ -plane indicated above. The distri- 
bution function of the marginal totals of is given by 6 


We find 


<5 ' 


<?s 2* j » 


6% s -£( r 3x -k) 3 


k < o < 
T***j~' 

% H. g -g /f. 


Thus the regression curve of ? on x is continuous but the re- 

Lr J>L* 

gression is non-linear for j * z < -y%. 

4, The correlation between the median f and the range W. 
Theorem III, Let ffxj be the probability function of the 
variable z . Let that of the median ? and the range 

IV in samples of independent values of X . If ffe) is a 

probability function of the first kind, then 

/y /J / r ,r T ^ 


u r,r-uy t J ^ 

Proof. We have 




' m+l 


r r x < - 

W'j 

,r 



/ fftM* 

V{ J 


J 


vk 

h 


X. f ~ V/ £ X 


m+1 * 






fi Cf, P. R. Rider, On the Distribution of the Ratio of Mean to Stand- 
ard Deviation, etc., Biometrika, Vol. 21 (1929), pp. 136-137. 
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Hence the theorem. 

If ffx) is a probability function of the second kind, then 

ft2mV)/ r r* 1 


[feteA \f titbit 


^ m-J 
cte f} Ws£ 

m-J 

<Ux t . 1ft sW. 


Finally, ' consider fft») to be a probability function of the 
third kind. We observe for 0 s ft s k , that O s TVs k . For 
assigned values of ft andW , the following regions of selection 
of k 1 are obvious : 

for O s ft £ k/2 , and Os TVs ft, 

or for k/2 < ft < k and O < TVs k- ft then ft s x, s ft+W; 

for Os ft s kj 2 and ft < W< k- ft , then TVs s ft* IV; 

for O s ft s kj 2 , and k- ft s TV s k , 

or for kj2 < fts k and ft s IV s k , then TV < x f s k ; 

for k/2 < ft < k and k-ft < TVs ft , then fts x, s k . 

If we write m .i „ m-J 


tvcxjfr^wj 


/*' 1 

r 

r i i 

/ frt)dt 

[ / fftijdt 

J 

:*r w 


we have 


x , ft2 m+1)! 


fl? 777+1 j! 




rf+V 

ffft) / ft Vx 1 , 

r 


7/ 

7F fff) JL *«*’■ 


fl? 77?+l)! 

J \fw-JjFf 

777+1)/ 

[ft rn-jjftf 
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over those regions of the flV- plane previously indicated. 
We shall consider two simple examples. 

Example 1. Let f(x)*e * Oz X <oo. 

With samples of three items, 

ff, Wj*3e Ws £ 

-Je ?<W. 


The regression is readily shown to be non-linear. 

Example 2. Let ffc) - jr , 0* x* k. 

With samples of three items, 




6 W 
k J ' 

6l_ 


'jpOt-W) 

* k-f. ), 


over those regions of the ^W^-plane which have been previously 
given. The mean of the array of W corresponding to an assigned 
f is ^ ~ . Accordingly, there is no correlation between the 
median and the range in samples of three items drawn from this 
universe. 

It is easy to employ the type of argument used in establishing 
Theorem III to obtain the probability function of the median and 
lower quartile. Thus, if f(x) is a probability function of the 
second kind and the probability function of the median 

f and the lower quarttle rf in samples of dfnt-l items, then 




r 

fftjett 


m-J 


r>af. 



ON THE DEGREE OF APPROXIMATION OF 
CERTAIN QUADRATURE FORMULAS 

By 

A. L. O’Toole 
National Research Bellow. 

If f(x) be a continuous function of period 2rr> and if the 
interval under consideration, say the interval from O to 2 77 , 
divided into m equal parts by the m+1 points x i =2irr/m > /«£? 
2,. , . . } n?j then the trigonometric sum of the border coinciding 
in value with f(x) at the >-n + l points , or the trigo- 
nometric sum of the nth order lacking the term in sin r?x , is, 
according as m «■ 2 n + 1 or 

fcj*~a 0 +a, cos x + cos ta n cos nx 

y t>. sir? x y 4 sin 2x+ • • • y b sin nx 

/ xf r? 


or 

u n fc)=j; a 0 + a , cos x, + a £ cos 2x v- +£ a n cos nx 

*4 s/nx y 6^ sin 2x y • • • y sinfn-l )x, 

where 


C. rr? 

a r* = 27 ffci ) cos kx * , /?* 

K 77 £mj 1 * m * 

/? m 

* n •stn kx; . 


If the Fourier coefficients of ffa) be denoted by 


1 f** 

- — J* ffajcos kx: aSx , 

C? 

/" ffaj sin kx dx. 


s sr 
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then it has been shown 1 that the interpolating coefficient^ a k and 
b k are approximations to the Fourier coefficients oc k and /3 k in 
the sense of the rectangle quadrature formula, in the sense of the 
trapezoid quadrature formula, in the sense of the average of the 
results of two applications of Simpson’s formula, and in the sense 
of higher quadrature formulas. In other words, the simple rec- 
tangle formulas a k and are as good approximations to the 
areas and (3 k as the estimates given by the trapezoid rule, 
the average of two applications of Simpson's rule, or higher quad- 
rature formulas. 

It is the purpose of this note to discuss certain quadrature 
formulas and to observe some other conditions under which the 
rectangle formula will give as good an approximation as the more 
complicated formulas. 

The most elementary and best known of the formulas are the 
rectangle formula, the trapezoid formula, and Simpson’s formula. 
Many of the more complex rules are the results of attempts by 
different investigators 2 to improve by various devices the approx- 
imations given by these three simple rules. 

Suppose the area under consideration is bounded by the curve 
y* ffe) , the ^ -axis and the ordinates at x»a and x* b , If the 
interval from a to b be devided into tv equal 8 parts, say of 
length h , by the r?+l points a, anc * 

if rectangles, each of width h and height * * * v n-1, 

be constructed, then the area as approximated by these n rec- 
tangles is 

( 1 ) 


l D. Jackson, Some Notes on Trigonometric Interpolation, Amer. Math. 
Monthly, vol xxxiii, no, 8, October 1927. 

2 See Runge and Willers, Encyklopadie Der Mathematischen Wissen- 
schaften, Bd. 11:3 (1915), pp. 45-176. 

^Discussion from point of view of least squares, Otto Biermann, 
Monatshefte Fur Mathematik Und Fhysik, 14 (1903), pp. 226-242. 

For unequal intervals see Jas, W. Glover, International Mathematical 
Congress, Toronto, 1924. 
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To find an expression for the error we assume the first deriv- 
ative exists, so that for the first rectangle 

ffx) « ffa) + Yx-aJ f Yu), 


/ 


a * 

ffx)dx - hffaj* Q- f YzJ, a<z< a+/i. 


Hence the error for the n rectangles is 

.<? 


<i«) 


- h ’z f'HJ - /W 


V*f 


2n 


f'fej, a<z<6. 


i.e., an error of the order of ~~ . 

Let r)*mk, k*l,2,3 t . If we approximate the area in 
the first k subintervals by a parabola of degree k coinciding in 
value with f6cj at the first k values of ^ , then integrating 
Lagrange's interpolation formula an expression for the error is 
obtained. If k is odd then 





fn+J)' 


, H- kh, 


where 


c, i*Yk*t*- 3*)- • • (kWk-efJdt. 

If k is even, then making use of Rolle’s Theorem, 


// k+3 / 


£ Jt “ 'V 1 2 


where 

Cj -pb 

The error over the whole interval will be obtained by summing 
the -m errors corresponding to each k subintervals. 
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If 77 trapezoids are formed by joining the ends of successive 
ordinates then the area as approximated by th$ sum of the areas 
of these trapezoids is 

<2) 

and the error is 
(2e) 

i.e., an error of the order of . 

Simpson’s formula may be obtained by passing second de- 
gree parabolas through the ends of three successive ordinates, 
that is k* 2 , and gives 


f rh), 

12r? z 


(3) 


A 




77 ) 




L y*o 


77**?Tr?. 


The error is 


(3e) 








i60r> + 

i.e„ an error of the order of ^ . 

To illustrate the fact that sometimes the rectangle formula 
(1) gives a better approximation than the Simpson formula (3) 
these formulas will be applied to the problem of finding the area 
Under the so-called normal curve of error. From a table 4 giving 
five places of decimals it is seen that the ordinates to the right 
of a? * 4, 76 and to the left of x = -4 76 are everywhere zero 


,if the equation be written in the form y 


i -f 
i w e 


Divide 


the interval from x = - 460 to x*4.8Q into eight partial in- 
tervals each of length 1.20. Formula (1) gives A^9999& while 

4 Ja$. W. Glover, Tables of Applied Mathematics in Finance, Insurance 
'and Statistics. 
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(3) gives A=. 97834, the same ordinates being used in each 
case. 

There are three objections to the nature of Simpson’s form- 
ula. They are the lack of smoothness at the points of intersection 
of the parabolas, the unequal weights attached to the odd and 
even numbered ordinates, and the requirement that the number 
of ordinates be odd. 

Catalan® notices the lack of smoothness at the intersections 
of the parabolas used in setting up Simpson's rule and improves 
on it by passing parabolas through three successive ordinates and 
then retaining only the first half of each parabola except in the 
case of the last three ordinates where it is necessary to retain the 
whole parabola. To counterbalance the asymmetry introduced by 
these last three ordinates he repeats the process beginning with the 
last ordinat and then takes the arithmetic mean of the two results 
as his formula. 

This gives 


(4) Am/? 




And, of course, the error is still of the order of . This 

formula has the additional advantage that it holds no matter 
whether n is even or odd. 

Similarly Crotti 4 * 6 showed that the different weights attached 
to the odd and even numbered ordinates in Simpson’s formula ic 
a disadvantage. And Parmentier 7 by subtracting Simpson’s form- 
ula from twice Catalan’s obtained a formula in which the weights 
are the reverse of those in Simpson’s. Mansion 8 gave an alterna- 
tive derivation of Catalan's formula, his derivation requiring, 
however, an even number of ordinates. 


6 E. Catalan, Nouvelles Annales, 1'* series (1851), pp. 412-415. 
fl Crotti, IJ Politechnio 33 (1885), pp. 193-207. 

7 Parmentier, Association frangaise pour favancement des sciences, Ses- 


sion Grenoble, 1882. 

8 Mansion, Supplement zu Mathesis 1 (1881). 
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Catalan’s formula may be thought of as the 'rectangle formula 
plus three correctional terms* involving the first three and the last 
three ordinates. In the case of an even number of ordinates a 
formula® involving only two such correctional terms and giving 
an approximation of the order of the error in the single trapezoid, 
i.e., of the order of , the error in a single trapezoid of width 

. (b-a) 


being 




, can be obtained by applying 


Simpson’s formula to the first ordinates and approximating 

the remaining area by the trepezoid rule. Repeat the process from 
the opposite end and take the arithmetic mean of the two results 
as the quadrature formula. This gives 


(5) a*h 


^ i2 iify+y**) 


77* 


It is the only formula with just two correctional terms which will 
give even this order of approximation in general because any 
change in the coefficients of these end ordinates will introduce in 
general an error of the order of the error in the rectangle formula 
for a single subinterval, i.e., an error of the order of jyt . 

Another important quadrature formula is called the three- 
eighths rule and is obtained by passing third order parabolas 
through four successive ordinates. It may be written 


( 6 ) * 


3b 
' 6 


m 

ZX Ji 

VmO 


<3V 


m-J. 

+ 3Z y, 

V*0 d 




*1 3?7? t 


The 


error is 


(6e) 




i.e., an error of the same order as the error corresponding to 
Simpson's formula. The error terms derived from the Lagrange 

"Durand, Engineering News, Jan. 1894. J. Lipka, Graphical and Me- 
chanical Computation, Part. II, p. 226. 
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formula 4iows the advantage of using parabolas of even degree. 

Besides the fact that the order of the error is the same as 
that in the case of Simpson's formula, this three-eighths formula 
has disadvantages similar to those mentioned in the case of Simp- 
son’s formula. There is still a lack of smoothness at the intersec- 
tions of the parabolas ; the weights attached to the ordinates are 
as undesirable as before ; and the number of partial intervals must 
be a multiple of three. 

It is possible however to do away with these disadvantages 
by proceeding as follows. Pass a third order parabola through 
the first four ordinates y o , y f , , y s . Retain only the area 

in the first two partial intervals. Pass a third order parabola 
through the four ordinates y f , , y 3 , y* and retain only the 

area in the central interval. Proceed in this way retaining each 
time only the area in the central interval until the last four ordi- 
nates are reached where it will again be necessary to retain the 
area in two strips, viz., the last two partial intervals. The sum, 
of these areas gives the required quadrature formula. It is 



This formula holds for any n greater than or equal to three. 
From the point of view of the order of the error this formula 
is, as one would expect, no better than Catalan’s formula. As a 
matter of fact formula (7) can be obtained from formula (4) by 


subtracting from (4) _ 3 ) a quantity which, in ‘ 

general, is of the order of * 

If n«4m and fourth order parabolas are used in approx- 
imating the area in four successive partial intervals then the 
formula is 


( 8)/4 


4h 

~~45 


nj-J 

,+16 Z j jj, 


/77~J! y 



ISO APPROXIMATION OF QUADRATURE FORMULAS 
The error is, 

(«e) ’ 

i.e., an error of the order of j?6 . 

Several modifications may be made to improve this formula. 
For instance if r?=*£rn+l then apply the fourth degree parabola 
to the ordinates y 0 , y , y , y , y A and retain only the area 
in the first three strips. Apply a fourth degree parabola to the 
ordinates y , y 3 , y A , y s , y d and retain the area in the two 
central strips. And so on till in the final step it will be necessary 
to retain the area in the last three strips. Addition gives the 
formula 

( 9 ) A^[s96£y^ +644Zy^ t - 654&% m Umry t +% m J 

~4966y^ +y^ m£ )+106{y 3 + y? m _4. Jj 

A formula which holds for any r? may be obtained by passing 
a fourth degree parabola through y o , y i , y z , y 3 , y 4 and re- 
taining only the area between y c and . Pass a fourth degree 
parabola through y i , , y s , y s and retain only the area 

between and y s . And so on, retaining only the area in one 
strip, until at the end it will be necessary to retain the area in the 
last three strips. Repeat the process beginning at the last ordinate 
and take the arithmetic -mean. The result is 

(10) A * h [£o y * 'j § ) 

*740 Ay3 * yn - 3 ^'j6d ^ y ^£ 

This formula can be obtained in the case of an even number of 
ordinates by retaining three strips at the beginning, two from 
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then on, reversing the process and taking the arithmetic mean. 

Formulas (4), (5), (7) and (10) not only give, in general, 
at least as good approximations as* Simpson's formula, the trape- 
zoid formula, the three-eighths formula, and the fourth degree 
formula (8) respectively, but in addition have the important prop- 
erty that under certain conditions they show that the simple rec- 
tangle formula must ^ive at least as good an approximation as the* 
higher formulas. If ffx) is a function such that the curve 
y • / (x) actually, or at least for practical purposes, coincides 
with the x -axis to the left of x-d and to the right of x*b f 
then in dividing the interval from a to b into h equal parts 
each of length h it will not affect the area required if two, one, 
three or four partial intervals of length h are marked off to the 
left of a and to the right of b , the number of such partial inter- 
vals corresponding to (4), (5), (7) and (10) respectively. Hence 
it is seen that under these conditions (4), (5), (7) and (10) 
reduce to the simple rectangle formula (1). 

If the curve coincides with the x -axis at one end of the 
interval over which the area is required but does not at the other 
end then formulas (4), (5), (7) and (10) become respectively 

(4a) ® 1 ~ J 4 y^-* 

£m-l '-p j v 

(5 a ) A *h X/ \ err, y^>r7-z A 

(7a) A m h ( - y y n y ri _ 1 - 

(10a) A*h(E°y„ y„ y„./ ^7^a y n-3 ~l 60 y ^^ 

For example, consider again the formal curve of error and sup- 
pose that the area to the left of the ordinate at x*o is required, 
Formulas (4a), (5a), (7a) and (10a) apply and for sixteen par- 
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tial intervals give respectively A*. 49994 A*.49530,A*.50008 y 
and A*.5000£, an extra partial interval to the left of x=-480 
being used in the case of (Sa) in order to have an odd number, 
of intervals for that formula, Using thirty-two partial intervals 
the same formulas give 4*. 49909 y 4**49949 > -4-. 50000, and 
A*. 50000 respectively. 

If, as often happens, the values of ordinates outside the in- 
terval over which the area is required are known then even better 
quadrature formulas may be obtained. For example, suppose that 
in deriving formula (7) the ordinate y mJ at a distance of h to 
the left of y and the ordinate y^ f at a distance h to the right 
of are known. Then it will not be necessary to retain the 
areas in double strips at the beginning and end of the interval, 
and the formula for the area over the interval from x*a to 
x* b is 


( 11 ) Anh\ 


£* - 


J 


yko' v 24 ^ * y n* ”% ^24 


It should be noted that ih case y mt and y n + f are known Cata- 
lan's formula reduces to (11). And, similarly, in the case of the 
derivation of formula (10) it will be necessary to retain the area 
in a single strip each time except in the case of the last application 
of the fourth degree parabola when it will be necessary to retain 
the area in the two central strips. The formula arrived at is 


'240 & y * " 


Formulas (11) and (12) reduce to the rectangle formula (1) 
under the same conditions as in the cases of (4), (5), (7) and 
(10). Likewise when the curve coincides with the a; - axis to the 
left of xm a (11) and (12) become 
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(11a) A * h (£ o y V -^y^i "jfjfc + J 4 y n-l) > and 

(12a) A=hf -j ~0 y „+ , ■— y n ^y r! . 1 y n . 2 + 1440 y^J 


If we apply formula (11a) to finding the area under the normal 
curve to the left of the ordinate at and take r>*4 1 h- 1 . 20 , 
a«-4.d0 then we find A*. 4-9999- In other words, in this case 
(11a) gives as good a result with six ordinates as (4a) or (7a) 
give with thirty-three ordinates or (6a) with thirty-four ordinates. 

Quadrature formulas involving parabolas of degree hjgher 
than four have been obtained but they are to be used with caution 
on account of the great freedom they allow the approximating 
curves. However, modifications similar to those in this paper could 
also be made for these higher formulas. And the effect of. any 
number of ordinates outside the ends of the interval could be 
noted. 

This note will be concluded with a remark on the effect of 
errors in the data giving the values of the ordinates. Suppose the 

quadrature formula is A^hfa,, y a +a,y f £ a n y n ) 

and suppose further that each y { - is subject to an error e/, i- 0 , 1 , 
2,3,'-', n . If e is the greatest of the absolute values of the ,e,- 
then the error in A cannot be greater than he(a a +a. 1 +a 2 + -+a n ) 

if a 0 ,a t , a Jt ,a n are all positive, as wilt be true if parabolas 

of the fourth degree or lower are used. But 
if the area is to be four from x* a to b . Hence the error in 
A due to errors in the data is not greater than e f b-a), When 
parabolas of degree higher than four are used the coefficients in < 
the quadrature formula are not always positive. 
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POLYNOMIAL APPROXIMATION BY THE 
METHOD OF LEAST SQUARES 

By H. T. Davis 

1. Introduction. In an earlier article in the Annals of Math- 
ematics the author in collaboration with V. V. Latshaw published 
formulas and tables for the fitting of polynomials to data by the 
method of least squares, 1 In that paper two ranges of the inde- 
pendent variable were considered, one from x~l to x=p , 
and the other from x,=-p to x.* p . For the first range formulas 
were given for fitting polynomials of first, second and third de- 
grees to data and these formulas were reduced to tables. For the 
second range formulas were given for polynomials from the first 
to the seventh degrees, but these formulas were not then reduced 
to tables. 

It is the purpose of the present paper to supply the tables 
for the second range and hence to furnish a means of reducing 
to a minimum the numerical labor involved in fitting to data poly- 
nomials from the first to the seventh degree inclusive. Incidentally 
some novel mathematical aspects of the problem of polynomial 
approximation have been brought to light, particularly as it ap- 
plies to the existence of a set of polynomials which are orthogonal 
for a summation over discrete intervals. 

The tables have been computed in the statistical laboratory of 
Indiana University and have been checked by duplicate calcula- 
tion. The computation has been made possible by grants of funds 
by the Waterman Institute of the University. The author is par- 
ticularly indebted to Dr. V. V. Latshaw, Miss Irene Price, Byron 
Shelley, George Davis, and Miss Anna Lescisin for the work 
which they have done in connection with the various computations 
of this paper. 


’Volume 31 (1930), pp. 52-78. 
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l. Formulas . Let us first consider the data to be given as 
a set of equally spaced items ; 


y 

i 

% 

y? 

yj ■ 

y'm 

X i 

5 


x s ••• 

■ • -*777 


in which we assume that the difference is constant. 

If m is an odd number, m = £p+l > we select zero as the 
center of the ^ -range and without loss of generality we replace 
the table just given by the following; 


y 

y* 


• y-r 

y<? 

y ■ 

‘ Xa-7 

y* 

* 

~P 

-p+i J • • 

• -I 

a 

i • ■ 

■ * 

P 


Let us designate by M r the moments, 

(1) M r = £ <s r y s , w. 

•S--/0 

If m is an even number, m-^Fp , we must make a slight 
change in the notation and consider the distribution, 


y 

y~p * * 

y-z 

y-7 

y, 

y> 


X 



-l/2 

j/y 

3/2 •• 

■ ■ fto-Jj/2 


The n-tf? moments, M' r , will be correspondingly equal to, 

m; - nr£ -v r [ X? * r-/j r y- s } . 

’ <5 -J. 

The method of least squares is then employed as described 
in the previous paper to determine the coefficients of the poly- 
nomial, 

(2) y = + a z x £ * + a r „x™ 7r>=J,Z 7. 

It will be unnecessary to repeat the explicit formulas obtained 
since they have been given m the previous paper, but it will be 
useful in explanation of the notation of the tables to give the fol- 
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lowing determination of the coefficients as linear functions of 
the moments ; 2 


7 he straight tine, 




(3) ‘ 7Syf> ' AM ° 


y= 

, <3L 


3f h- 


1 P( r P+*)fJ/o+l) 
7 'he parabola , y - ^ ^ cz r oc -/• x* 


a'm , . 


a 0 *4M 0 + 3M;, 

(4) ^ determined from (3), 

<2* = BM 0 + CMj , 


J. The cubic, y = cz 0 +a y x + x 3 . 


a, 

(5) o', and < 2 , determined from (4). 

ct 3 ^3'M^C‘Mj, 


4. The qua) tic, y* a a + a,x +■ f 


a 0 *AM 0 +£MjjCM 4 , 

(6) =BM a i-DMj + 5M+, a. and a 3 determined from (S). 


5. The quintic, y*<a 0 +a / x+a J ,x s +a 3 x 3 +G!yc‘%a s x s 

(7) aj-B'Mf+B'Mj + B 7‘g., c^^and a 4 determined from (6). 
Os-C'M, + £‘%+F'M Si 


6. The sextic, y* a 0 ->-a J x+( 2 £X‘ s +ajX, 3 +a 4 x'*+ ajX^agX 6 


-The nutation follows that of the previous paper. It should be noted 
that the coefficients -4, 3 , C , etc. for the straight line, the parabola, 
the cjuaitiL, and the scxtic, and the coefficients A\ 3' } C' } etc. for the 
cubic, the quintic, and the septic are all given by different formulas, but 
it is hoped that the omission of subscripts denoting the degrees of the poly- 
nomials will lead to no confusion. 
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a o *AM 0 +3M 4 +CM4 + DM 6 , 

a z *£M a +£/% + r/% +GM 6 , 

(8) a /' a j and a s determined 

from (7). 

a 6 ~DM 0 + GM*+IM++JM 6 , 

7. The septic, y* a 0 +a*x:*+a s x* +€2^x* 
OfX s +cr4<t 6 +a r >!: 7 > 

<z f . A H +* + C'/% +D'M r> 

<%* 3ty+£'M s +f'M <T +&‘M r , a 0f cZg , 

(9) nnd a 6 determined 

a s * C*M f +F'Mj+ H'Mj+I'Mr, from (8). 

3. Orthogonal Polynomials , In a paper the significance nt 
which has perhaps never been fully appreciated, J. P. Gram in 
vestigated the problem of polynomial approximation over disci etc 
intervals by means of orthogonal polynomials. 3 This method ha 6 ' 
since been more fully investigated by Edward Condon 4 and his 
work was made the basis of a method for obtaining least squares 
polynomials by R. T. Birge and J. D. Shea.® The work of the 
latter, however, while effecting a simplification, does 'not reduce 
the problem to its simplest form. 

In a recent paper issued by the Hungarian National Commit- 
tee "on Economic Statistics, Karl Jordan has employed orthogonal 
functions in connection with binomial moments and has very 

Hlber die Entwickelung reeller Functionen in Feihen mittelst der 
Methode der kleinsten Quadrate. Journal fiir Math., vol. 94 (1883), pp. 
41-73. 

*The Rapid Fitting of a Certain Class of Empirical Formulae by the 
Method of Least Squares. Univ. of California Publications in Mathematics, 
vol. 2 (1927), pp. 55-66. 

5 A Rapid Method for Calculating the Least Square Solution of a 
Polynomial of any Degree. Ibid., pp. 67-118. 
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greatly simplified the numerical work of curve fitting. 11 The poly- 
nomials which he employs, however, appear in the form, 

y* <z 0 + +&* xfo-V + Gj * )x-lXx-£)+ , 

although in the final result they are numerically equivalent to the 
polynomials of the present paper. 

Let us begin with a set of polynomials, 

0 O X)> t f pe), £,Pc), 4 m Pc), 

of degrees 0,1^,^ m respectively such that, 

Z d fej 0. fe)=* O, for &£t m 


Assuming the existence of such a set of polynomials we can 
approximate by means of them a function ffc) which is defined 
over the set of integers from -p to p , 

Writing the approximation equation, 


( 10) fpc) = 4 fc) + 4, Oc)+— + 4 m pc), 

we multiply by 4 > r (%) and sum from -p to p . .We then obtain, 

* *> - 

(11) 22 f fe) pc) = P ~ , where 6--22 & Pc). 

~P f ~P 

If we represent the polynomial by the series, 


u See Berechnung der Trendhnie auf Grund der Theone der kleinstcn 
Quadrate, Budapest (1930) and Praktische Anwendung der Trendberech- 
nungs-Methode von Jordan, by A, Sipos, Budapest (1930), 
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it is clear from the definition of the moments (1) that we get 
from (10) the evaluation, 

( 12 ) 

U x / 

That these coefficients are identifiable with those explicitly 
given in equations (3) to (9) is a consequence of the following 
consideration: 

Let us approximate ffa) by minimizing the following sum: 
p 

which is equivalent in its result to the somewhat different method 
employed in the actual determination of the formulas (3) to (9), 
Taking the derivative of J with respect to <4^ and equating 
the result to zero we get, 

\ffa) m fc) r*J\ - & > 

whence, recalling the orthogonality of ihe puhnoinMjs, 

( 13 ) 4 . - P ffr) C*)/jr - 

We thus see that the ratios ^ /S r can be written down ex- 
plicitly by comparing them with the corresponding coefficients of 
M s in equations (3) to (9), 

In particular, if , we find the coefficients of the poly- 
nomial by equating the right member of (13) with the 

corresponding last row in the formulas (3) to (9). For example, 




f(k)-A Q 4 fa) - 


A) ^7 fa) 


-I 
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t£ r*S , we have, 


ch/f+eK ^ H 


j> 


Ilencc we get, 


4-^3 


^9 ~ {*£* 3 


J3y means of this identification we obtain as the hist seven 
polynomials the following : T 

4 , (*)~ i/( ( 2 p+i) ‘A , 

/x)= 3x,/p{ptl)f /? p+jj* ,4 

\p(p+l)(4 p*-lX2p+3) / 1 * 2 ' Pf 3 ) = C;A+B, 


*> *>• 


4pfp2-J)f4. P * })(4f2-9X2p*5Xp^A ' 

r,* 3 p(p z :ll(pj£l ) , Fx % £ K * + c, 

17 J5 ) 

l 


, , , r JV.^. j. 

4 Ma \4p(p*-lX4p z -D(4p*- 9)(p z Aj(2pi-5)(pt3) J 




j S(2p‘ i +2p-3)X CtSp^30p 3 ~ 35 p K 5QpPZ k 


9 


63 


} 




~ F'xf+EW+Cx, 
r _ A z JXJJ1JA- 


o+7 j 


\4p(p*-lXp z -4XA P z -9X4p 2 '2SXp'3X2p* 

[ b Sfcpt+Sp-rJA* 6Sp 4 '+J 0p } -£0p 1! -2Sp+Mjx z 

r ' ‘ u " n 

Sp7p*-l)Cp J: - 4Xp>-3) 1 .JxKlx\&x z +Z>, 

3 7J1 I 


r 3 j^ 2 13 ^ ) 

~ ? . i ~ ~ a yr\y\ _ 9 y s\JT ^ JS\4 ).,4 


1 


T - 7(3p A +3p-dO)x s 7fi5/A+30p 3 - 9Qp*-IO5p+i0J)x.? 

73 + JT3J~ 


x' '~^~ i ~ 7 X l ~ , 

- 35 p 6 + 105 p b - 286 p 4 - 735 p 3 +-4 9 Tp z + 882 p - 18 Q)x j 

4> 2 

In order to effect the computation of the sum 3^ =j£ 


7 See note at end of this set non 
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we replace the value oM w as given by (13) in (10) and compare 
the coefficient of a;' 77 with the corresponding coefficient in the 
proper formula of the set from (3) to (9). Thus for since 
- F*x s + 7* C'x, we have from the quintic 

approximation, 

fM* x s fFMf + £ M a + C J Mj) -h terms of lower degree, 

. r^*f**% t **"* I”*) , 

- r * ( " ‘ gr 1 1 * ~ — y + terms of lower degree, 

(F' z w 5 r'e'rtz F'c'M^ s 

J — y. - — * + — ~ L y. terms of lower degree, 

1 °<5 / 

Equating the coefficients of it is clear that * F\ 

4. The Recursion Formula . It will be obvious from the pre- 
ceding discussion that the polynomials which we have investigated 
are essentially the analogue of the well-known Legendre poly- 
nomials, where the integration between the limits -1 and + 1 
used in the definition of the latter's orthogonality is here replaced 
by the discrete sum over the integers from -p to +p . We might, 
therefore, expect to find a recursion formula connecting any suc- 
cessive three of the new polynomials similar to the recursion 
formula which exists for the Legendre case. It turns out that 
this expectation is justified and we find the following relationship 

holding between &) * and fe) : 

f n+lj^fep-r)) f 2/3 + 17+2) ^ 

( 14 ) - 4Y'2n+j)fpr?+<})x <) n Ox) 

+ 4/'2n + j)f2r?+3) M) - O 

From this equation we easily deduce that the coefficient, 
of^^^in related to the coefficient, ^ , of x 77 in 
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^ /^as follows : 

From this we obtain by iteration and a proper change in no- 
tation, the following value for <fi n ; 

f* vwj*. ■ }/{fnO^&)- 

(15) / 

^ '4p*4jftp2gJ . . fr?-f r? ' 

The value thus obtained is at once seen to be equal to 

<-5/7 -■£ r-x). 

~P 

As an example consider the case where r?=£. We then have, 
4W C*xteCOx*+6*m a*{x«-Pofp+JJx* +p*fa +J) */ 9 J > 

where we abbreviate, at* = 3 Yp +lXf4p‘ 8 -ljYZp+3j z 

We then obtain, 

Z a* [L^c^-Zpfp+ljZ j\pYpYXtfp+l)/$ J , 

- c^V^P+lXZp+lXlp^+Jp-jO -3pYp+lJ 3 Z?pi-l) 

+pYp+V*fep+i)/ 9 }, 3 

= <z*{pCp+lX4p*-V&p+3j/‘45}* 3*5/ P rp+Wptyft»J) 
which is seen to agree with the value of as calculated directly 
from (15). 8 

' s The dehnition of the which we have given above was c(iosen 

for the obvious connection which the functions in that form have with the 
problem of curve fitting and with the computed values in the tables, If, 
however, the coefficient of x, n were reduced to unity, 
dtp**, 6 l ('x;)=x.'*-pCp +JJ J 3, 
etc., then the recursion formula (14) would have been, 

4(+n*'l)# nt .j(k)-4C4n*-i)&4 n O6)+n*fejo*Tt+X)C2p+ni‘l)0 n ^ i fe)*O 

If, moreover, ^ fe) as just defined were multiplied by the coefficient 
J 35... (2 >?-l j/ 7?/ , which is the multiplier of the corresponding 
Legendre polynomials, then the recursion formula becomes, 

frj + r>6fp ’»+Ufep+”+V^/*0*O 
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we replace the value of A) m as given by (13) in (10) and compare 
the coefficient of X ^ with the corresponding coefficient in the 
proper formula of the set from (3) to (9). Thus for m=5, since 
= F'x'** F'x s + C'x, we have from the quintic 
approximation, 


f(a)= x s fF /% y- F M s + /■ terms of lower degree, 


= F‘x 





■+ terms of lower degree, 


fF> Z M S r>€‘M 3 r'c'M^ s 

*( — r — ~ + r * + — r — U + terms of lower degree, 

^ / 

Equating the coefficients of it is clear that 
4. The Recursion Formula. It will be obvious from the pre- 
ceding discussion that the polynomials which we have investigated 
are essentially the analogue of the well-known Legendre poly- 
nomials, where the integration between the limits -1 and-^Z 
used in the definition of the latter's orthogonality is here replaced 
by the discrete sum over the integers from -p to +p , We might, 
therefore, expect to find a recursion formula connecting any suc- 
cessive three of the new polynomials similar to the recursion 
formula which exists for the Legendre case. It turns out that 
this expectation is justified and we find the following relationship 

holding between J and : 

( r)+t) z {2p-r)J { Fp + n+ 2 ) 

( 14 ) - fx) 

From this equation we easily deduce that the coefficient,^./, 
of^^^in fc ) is related to the coefficient, ^ , of x” in 
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follows: 

4>+/ = frt+Jj^fcp-pJfep+p+z) 

From this we obtain by iteration and a proper change in no- 
tation, the following value for <fi n 

6, = }k(nO*tp#P*-V- 

( 15 ) / 

f4/>ur4tA9j.. . 

The value thus obtained is at once seen to be equal to 

0 

Jp *27 rx). 

-jO 

As an example consider the case where We then have, 

C*x%ZC&s*+6*m a *{x+jpf p +l)x* +pYp+J)*/9} t 

where we abbreviate, a* = 3 ^-J^/pYp jY3p-f-3) z 

We then obtain, 

P P P p 

£ $ z Cx)*a * [Ex ^-2p(p J +pYp+lJ*{2p+l)/9 J, 

= cY i \^p+iypp+jj0pY3p-£) - ^p Yp+iyfep+i) 

■<- pYp+V*fep+lj/ 9 }, ^ 

- {pCp+lX4p*UrJp+3j/4Sl* 3?s/pfp+Wp*-Jjfr>*3) 

which is seen to agree with the value of Sg as calculated directly 
from (15). 8 

s The definition of the which we have given above was chosen 

for the obvious connection which the functions in that form have with the 
problem of curve fitting and with the computed values in the tables. If, 
however, the coefficient of x, n were reduced to unity, 
dM**, jjMexyt-pfp+JJ jj, 
etc., then the recursion formula (14) would have been, 

4(4n^l)<A n ^j (k)-4fan*~X)se <fi n <k)+rt*&p~ O 

If, moreover, ^ fc) as just defined were multiplied by the coefficient 
1<3 5 .* t (Zp-lJ/r? / , which is the multiplier of the corresponding 
Legendre polynomials, then the recursion formula becomes, 

4/h+t)4n+j fe) - 4 
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5. The Polynomials of Gram. It will be at once evident that 
the results obtained above permit us to define a new set of poly- 
nomials which are orthogonal over the discrete range from 
x = S to x = 

In the former paper in the Annals it was proved that the 
formulas for the coefficients of the least square polynomial, 

y- A f ^ + * m > 

fitted to data given over the discrete range to can 

be obtained from the coefficients, a <>J a fJ of equa- 

tion (2), by means of the following substitution: 


W - 'V7 - 
r r 




rfr-i) 
2/ I 



where M r arc the moments defined by (1) and rr? r are the 
moments, 

P' 

m r cr%. 

Conversely we can pass from the range to -v -/0 

to the range to , by means of the substitution: 


m r 


•M r + 






/? = 2p+J, 


a 'bjZJt 

2 / 




Mr- 2' 



Replacing M r and byx r , it is clear that new polynomials 
are obtained which belong to the range 1 to . 

The polynomials may be explicitly evaluated from 
+ - + <fi rn x rrv 
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as follows; 

fa) - 4 * A ' AJ fa*- ^ t** 1 +&*)+ — 

y- J77 ~ mf>x r77 ~ J + mf'm-JtJf?** . . .j t 

where b^fp-lj/z and An denotes the value of An after 
the substitution p^fp-ljfe. 

These polynomials can be proved by the method of section 3 
to be orthogonal over the discrete range ^®/to -tf»//and they are 
identifiable with the last lines of the formulas (3), (4), and (5) 
of the Annals paper previously cited where the ?r? r are replaced 
by x r , 

Polynomials orthogonal over the disci ete range x*Oto xm-J 
were first obtained by Gram in the paper to which we previously 
refen eel and hence (16) maj properly lie called the Gram poly- 
nomial of rn tf? degree , 

The following explicit formula, in the notation of the present 
paper where the range is from 1 K.ox= p , was derhed by 
Gram : 

)fr f*) A 1 frn+l)(p-Z)l(£~l) 

m \fp-m-l) 1 1 1 rr ? 7 f m-l)!'!** 

{ 77 ? +1 ){m+3)fp~ 3)! 
f/77- 

fir? +1J( m+Z/rrH- 3)fp - 4)/f g - t)fx-2)fx - 3) + 
fm- 3) yj/ * 

Since the coefficient of x'" m equals ^ 

since the 
. - f?rr?) 

coefficient of x m in Gram's definition is (-IJ ( tk+Isitit+JJ' ~ j > 
it is clear that die following equation holds between fy) and 




166 


POLYNOMIAL APPROXIMATION 


>7?+jJ >77 /} fcj, 

-\f-J) m p('p~lJfP' s -4)- ■ -fp 3 - m Vfm.0/x KvJ/f2io?+Jjj ftP-tX 
By methods previously used it can be shown that, 

P 

z%;rx>.tYpj. 


The first four Gram polynomials are given below explicitly: 0 

%(*) =/ A 

l///x)= \lzjpf P X)\ \_x-(p+V/j _ ) , 

t/x Yso/prp^Xp^ \x Up+l)x + fp+l)fp+xl6\ , 

%&)* \z8ooipr P *jXp*4/ P 4 9 J] \x*-sg>+jj f + 

Aft 5p 2 >-15p-f-lljx ~ (p+lj(p+£)(p+s) j 40^ . 

6. Tables and Numerical Application, In tables 1 to 7 the 
numerical values of the coefficients of equations (3) to (9) have 
been tabulated for values of p by half integers. For the case of 
the straight line the range of p is from 0.5 to 100.0; for the 
parabola the range is from 1.0 to 100.0; for the cubic the range 
is from 1.5 to 50.0; for the other polynomials the range does not 
exceed p~ 25 0 The tables have been computed to ten signifi- 
cant figures and have been checked by duplicate calculation. 

fi These polynomials are essentially the same as those employed by 
Jordan (loc. tit.) except that the summation in his work has taken over 

the numbers 0, 1, 2 , His polynomials are also expressed 

in teims of the Newton polynomials; 'fa-nj. 
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In illustration of the application of these tables to the numer- 
ical problem of polynomial approximation, we shall fit polynomials 
to the data employed by Karl Pearson in the same connection, his 
method being, however, the method of moments. 10 The data are 
from T. N. Thiele 11 and consist of a system of fiequencies ob- 
tained from a game of patience (solitaire) : 


Value of character 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

F requency 

0 

3 

7 

35 

101 

89 

94 

70 

46 

30 

15 

4 

5 

1 

0 

Class marks (x>) 

B 

H 

i 

D 

-3 

“2 

-1 

0 

1 

2 

3 

4 

5 

6 

_7 


Computing the moments, using the values of x for this pur- 
pose, we obtain the following: 


M 0 - 500; = ~J70; M^2728; /%^5508; 

34/08; /% *-76880; 626188, /% - -1419788. 

These values are then substituted in equations (3) to (9) 
and the coefficients of the desned polynomials thus obtained. In 
illustration we shall give only the computations foi the parabola. 

From the value corresponding to p^7 in column A* of 
table 1 we obtain, 

ct f = M f .&j357 1428571* -2 035 714; 

Similarly fiom the values corresponding to p-7 in columns 
A f 3 9 and C of table 2 we compute, 

. 1511312217'/% -82)452 4886 878 * 63221 719 \ 

452 4886878 + /% • 0)242 4046 542-16011635 

10 On the Systematic Fitting of Curves to Observations and Measure- 
ments Biometrika, vol, 1 (1902), pp. 265-303; vol 2 (1903), pp, 1-27, in 
particular, p. 18. 

il Forelaesninger over Almindehg Iagttagclscslaerc, Copenhagen, (1889), 

p. 12 
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Proceeding m this manner the other coefficients are easily 
computed and we obtain the following seven polynomials of ap- 
proximation : 

y- 33 333333 - 2 033714 x, 
y* 63 221719-2. 035714 x - 1. 601163 x*, 
y . 63. 221 719- 9 924624 X - /. 601163 x ^ . 236195 x f 
y= 75 058367- 9 924624 x-3.760517x*x236195x J 4- 045666 x* 
y-. 75056367-20. 40589 Ox -3 76051 7x*+l 13p793x*+ 0456662 * 
- 014922 x* 

y - 73 950386 -20 40 5890 x - 3.320566 x z + 1 139793 x J 
+. 020390 x 014922 x s +. 000338551 x f 
y - 73. 950386 - 25. 763034x -3 320586 x*+. 2 070322 x 3 
+. 020990 x*-. 0543 16 X s -/-. 000336551 x 6 x.0004607106x 7 . 

The following table contains the values computed from these 
polynomials over the range from x =-J to x = 7 \ 


HI 

ni 

f?* 1 

n * 2 

rt* 3 

77 * 4 

77 * 5 

77 s 6 

«--7 

~7 

0 

47 583 

- 985 

-26 778 

-11.105 

3.123 

3.916 

.591 

-6 

3 

45 548 

17.794 

14.109 

7,593 

- 8.863 

- 10,448 

- 3,483 

-5 

7 

43.512 

33 371 

43.291 

29 685 

15.773 

15,469 

12.432 

-4 

35 

41 476 

45.746 

62.185 

51.163 

SO. 538 

51.513 

45 975 

-3 

pM 

39.440 

54.918 

72 208 

68.309 

78 982 

80.073 

79.537 

-2 

89 

37 405 

60 888 

74.777 

78 707 

92.918 

93.194 

97 660 

- 1 

94 

35.369 

63.656 

71.309 

81.032 

90 625 

89.932 

94 397 

0 

Mil 

33.333 

63,222 

63.222 

75 058 

75.058 

73.950 

73 950 

1 

46 1 

31.298 

59 585 

51.932 

61.655 

52 062 

51.370 

46.905 

2 

m 

29,262 

52,746 

38 857 

42,787 

28.576 

28.853 

24.388 

3 

15 

27.226 

42,704 

25.415 

21.516 

10.843 

11.935 

12.471 

4 

4 

25.190 

29.460 

13 021 

1 999 

2.624 

3.599 

9.136 

5 

5 

23,155 

13,014 

3 094 

- 10.512 

3 400 

3 095 

6 133 

6 

1 

21.119 

-6 634 

-2 950 

* 9.667 

6 589 

5 005 

- 1 959 

7 

HD 

19 083 

-29,485 

- 33.693 

11 980 

- 2,249 

- 1.457 

.869 


The approximation attained by these polynomials is exhibited 
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in the following chaits, the odd order cases hcing given in figure 
1 and the even order cases in figure 2. 


In oi dei to illustrate the case wheic the nunibei of items is 
even we shall delete the last value from the senes which we have 
just used. The table must then be arranged as follows- 


Frequency 

O 

3 

7 

35 IOI 89 

94 

70 

Class maik 

-13/3 

-11/3 

-9/2 

-7/3 -5/2 -3/2 

-J£_ 

J/2 

Frequency 

16 

30 

JO 

4 5 1 



Class matk 

0/2 

5/2 

7/2 

9/2 11/2 13/2 




The method will be sufficiently illusttated by means of the 
fit st, second, and fifth degiee polynomials To compute these we 
fust obtain the moments. 


M a = 500, 07, *-320, 01^*2263, Af, *-1781, 

/%. = 36930. 35, Mj. *- 1633 5. 

In order to evaluate the coefficients of the parabola we use 
the value corresponding to /a = 6.5 in column 6 ' of table 1 and 
the values in columns 5,3, and C of table 2 as follows 
14 , ; 53J 139 3601 306 = - J. 406 393 

a g */% 163 1093 330-/% 72 J 558 0137 143 =68 31 1 734, 
a^-r% (3) 338 0357143 c AT, {'ll 343 406 5 934 *-2 006181 

The other coefficients aic similarly obtained and we thus de- 
rive the following appi oximating polynomials : 

y* 33 714336-1 106593 x, 
y* 63 314734 -1. 106593k -3 006181 x f 
y--8J 37350-16 631 50x -5 172034 x *+11 19331 x 077082x 4 
-017301x1 
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The approximating values obtained from the parabola and the 
quintic are recorded in the following table : 



y 

r> - x? 


Xr 

X 

/7-^ 

*?~J5 

-6.5 

0 

- 7.304 

1 492 

0 5 

70 

67.110 

73.912 

-5.5 

3 

15.364 

- 12 686 

1 1.5 

46 

61 691 

50 922 

-4.5 

7 

34 019 

13.214 

2.5 

30 

52 260 

28.698 

-3.5 

35 

48,662 

49.869 

3.5 

15 

38.816 

13.295 

-2.5 

101 i 

59.293 

79.419 

4.5 ! 

4 ! 

21.360 

7.280 

-1.5 

89 

65.9U 

93 329 

5.5 

5 

-.108 

7.592 

-0.5 

94 

68.516 

90.257 

il 6.5 

1 

- 25 . 589 

3 ,407 


In the article from which these data are taken, Karl Pearson 
compares the efficacy of polynomial curves with that of skew- 
frequency curves and shows the superiority of the latter in the 
present case. It is worth noting here, however, that the least square 
polynomials of the present paper give a measurably better fit than 
the moment polynomials employed by Pearson. The sum of the 
squares of the deviations from the data of the values obtained by 
means of the sixth degree parabola of Pearson is found to be 
1402.31 ; the same sum for the sextic of the present papeir is 
1091.22, For the septic the sum of the square of the deviations 
is 926.32, which compares not too unfavorably with the sum 760,91 
obtained from the skew-frequency curve. 
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TABLE I 


(The numbers m parentheses denote the number of ciphers 
between the decimal point and the first significant figme.) 


JO 

A 

A ' 

0 5 

.500 0000 Q 00 

2.000 0000 000 

1.0 

.333 3333 333 

.500 0000 000 

1.5 

,250 0000 000 

.200 0000 000 

2.0 

.200 0000 000 

.100 0000 000 

2.5 

166 6666 667 

.( 1)571 4285 714 

3.0 

142 8571 429 

.( 1)357 1428 571 

3.5 

.125 0000 000 

.( 1)238 0952 381 

4 0 

in mi m 

, ( 1)166 6666 667 

4.5 

.100 0000 000 

. ( 1)121 2121 212 

5.0 

.( 1)909 0909 091 

( 2)909 0909 091 

5.5 

. ( 1)833 3333 333 

.( 2)699 3006 993 

6 0 

.( 1)769 2307 692 

( 2)549 4505 495 

6.5 

.( 1)714 2857 143 

.( 2)439 5604 396 

7 0 

( 1)666 6666 667 

( 2)357 1428 571 

7.5 

.( 1)625 0000 000 

.( 2)294 1176 471 

8,0 

.( 1)588 2352 941 

( 2)245 0980 392 

8 5 

.( 1)555 5555 556 

.( 2)206 3983 488 

9,0 

.( 1)526 3157 895 

.( 2)175 4385 965 

9.5 

.( 1)500 0000 000 

.( 2)150 3759 398 

10.0 

,( 1)476 1904 762 

1 .( 2)1298701299 

10.5 

.( 1)454 5454 545 

.( 2)112 9305 477 

11.0 

,( 1)434 7826 087 

( 3)988 1422 925 

11 5 

.( 1)416 6666 667 

.( 3)869 5652 174 

12 0 

.( 1)400 0000 000 

.( 3)769 2307 692 

12.5 

! .( 1)384 6153 846 

. ( 3)683 7606 838 

13.0 

.( 1)370 3703 704 

.( 3)610 5006 105 

13.5 

.( 1)357 1428 571 

.( 3)547 3453 749 

14.0 

.( 1)344 8275 862 

.( 3)492 6108 374 

14.5 | 

.( 1)333 3333 333 

.( 3)444 9388 209 

15 0 

.( 1)322 5806 452 

( 3)403 2258 064 

15.5 

.( 1)312 5000 000 

.( 3)366 5689 150 

16.0 

,( 1)303 0303 030 

, ( 3)334 2245 989 

16 5 

.( 1)294 1176 471 

.( 3)305 5767 762 

17 0 

.( 1)285 7142 857 

( 3)280 1120 448 

17.5 

.( 1)277 7777 778 

.( 3)257 4002 574 

18.0 

.( 1)270 2702 703 

( 3)237 0791 844 

18.5 

.( 1)263 1578 947 

. ( 3)218 8423 241 

19.0 

1 .( 1)2564102564 

. ( 3)202 4291 498 

19.5 

.( 1)250 0000 000 

.( 3)187 6172 608 

20.0 

. ( 1)243 9024 390 

.( 3)174 2160 279 

20 5 

( 1)238 0952 381 

.( 3)162 0614 213 

21 0 

. ( 1)232 5581 395 

.( 3)151 0117 789 

21,5 

. ( 1)227 2727 273 

,( 3)140 9443 270 

22.0 

. ( 1)222 2222 222 

.( 3)131 7523 057 

22 5 

.( 1)217 3913 043 

.( 3)123 3425 S 40 

23.0 

.( 1)212 7659 574 

( 3)115 6336 725 

23,5 

.( 1)208 3333 333 

.( 3)108 5540 599 

24 0 

.( 1)204 0816 327 

( 3)102 0408 163 

24.5 

. ( 1)200 0000 000 

. ( 4)960 3841 537 

25,0 

.( 1)196 0784 314 

.( 4)904 9773 756 
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TABLE I — ( Continued) 


/O 



25.5 

.(1)192 3076 923 

. (4)853 7522 411 

26.0 

.(1)188 6792 453 

.(4)806 3215 610 

26.5 

.(1)185 1851 852 

.(4)762 3403 850 

27.0 

.(1)181 8181 818 

.(4)721 5007 215 

27.5 

.(1)178 5714 286 

. (4)683 5269 993 

28,0 

.(1)175 4385 965 

. (4)648 1721 545 

28.5 

.(1)172 4137 931 

. (4)615 2142 484 

29.0 

.(1)169 4915 254 1 

. (4)584 4535 359 

29.5 

,(1)166 6666 667 

.(4)555 7099 194 

30.0 

.(1)163 9344 262 

.(4)528 8207 298 

30.5 

.(1)161 2903 226 

.(4)503 6387 903 

31.0 

.(1)158 7301 587 

. (4)480 0307 220 

31 5 

.(1)156 2500 000 

.(4)457 8754 579 

32 0 

.(1)153 8461 538 

.(4)437 0629 371 

32 5 

.(1)151 5151 515 

.(4)417 4929 548 

33.0 

.(1)149 2537 313 

.(4)399 0741 480 

33 5 

.(1)147 0588 235 

. (4)381 7230 981 

34 0 

.(1)144 9275 362 

.(4)365 3635 367 

34.5 

.(1)142 8571 429 

. (4)349 9256 408 

35 0 

.(1)140 8450 704 

.(4)335 3454 058 

35.5 

.(1)138 8888 889 

.(4)321 5640 877 

36 0 

.(1)136 9863 014 

,(4)308 5277 058 

36 5 

.(1)135 1351 351 

.(4)296 1865 976 

37 0 

.(1)133 3333 333 

.(4)284 4950 213 

37 5 

.(1)131 5789 474 

.(4)273 4107 997 

38 0 

.(1)129 8701 299 

.(4)262 8949 997 

38.5 

.(1)128 2051 282 

.(4)252 9116 453 

39.0 

.(1)126 5822 785 

.(4)243 4274 586 

39 5 

.(1)125 0000 000 

.(4)234 4116 268 

40.0 

.(1)123 4567 901 

.(4)225 8355 917 

40 5 

.(1)1219512 195 

.(4)217 6728 595 

41 0 

.(1)120 4819 277 

.(4)209 8988 288 

41 5 

.(1)119 0476 190 

.(4)202 4906 348 

42,0 

.(1)117 6470 588 

.(4)195 4270 080 

42 5 

.(1)116 2790 698 

.(4)188 6881 457 

43.0 

(1)114 9425 287 

.(4)182 2555 952 

43 5 

.(1)113 6363 636 

.(4)176 1121 482 

44 0 

.(1)112 3595 506 

.(4)170 2417 433 

44 5 

. (Din mi in 

.(4)164 6293 781 

45 0 

.(1)109 8901 099 

. (4) 159 2610 288 

45 5 

.(1)103 6956 522 

.(4)154 1235 763 

46.0 

.(1)107 5268 817 

.(4)149 2047 387 

46 5 

.(1)106 3829 787 

.(4)144 4930 102 

47.0 

.(1)105 2631 579 

, (4) 139 9776 036 

47 5 

.(1)104 1666 667 

.(4)135 6483 993 

48.0 

.(1)103 0927 835 

. (4) 131 4958 973 

48.5 

.(1)102 0408 163 

.(4)127 5111 732 

49.0 

.(1)101 0101 010 

.(4)123 6858 380 

49.5 

.(1)100 0000 000 

.(4)120 0120 012 

50 0 

. (2)990 0990 099 

•(4)116 4822 365 
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TABEE I— (Continued) 


yO 

A 

A' 

50.5 

.(2)980 3921 569 

.(4)113 0895 500 

51.0 

.(2)970 8737 864 

.(4)109 8273 514 

51.5 

.(2)961 5384 615 

.(4)106 6894 271 

52 0 

. (2)952 3809 524 

.(4)103 6699 150 

52 5 

.(2)943 3962 264 

.(4)100 7632 819 

53.0 

. ( 2)934 5794 393 

. (5)979 6430 181 

53.5 

.(2)925 9259 259 

.(5)952 6803 662 

54 0 

,(2)917 4311 927 

.(5)926 6981 744 

54 5 

. (2)909 0909 091 

.(5)901 6522 778 

55.0 

. (2)900 9009 009 

.(5)877 5008 775 

55.5 

.(2)892 8571 429 

.(5)854 2043 940 

56.0 

.(2)884 9557 522 

. (5)831 7253 310 

56 5 

.(2)877 1929 825 

.(5)810 0281 485 

57 0 

.(2)869 5652 174 

.(5)789 0791 446 

57.5 

.(2)862 0689 655 

.(5)768 8463 461 

58 0 

.(2)854 7008 547 

.(5)749 2994 051 

58.5 

.(2)847 4576 271 

' .(5)7304095041 

59.0 

.(2)840 3361 345 

.(5)712 1492 665 

59.5 

.(2)833 3333 333 

.(5)694 4926 731 

60.0 

.(2)8264462810 

.(5)677 4149 844 

60.5 

.(2)819 6721 311 

.(5)660 8926 677 

61.0 

(2)813 0081 301 

.(5)644 9033 290 

61.5 

.(2)806 4516 129 

.(5)629 4256 491 

62.0 

.(2)800 0000 000 

.(5)614 4393 241 

62.5 

. (2)793 6507 937 

.(5)599 9250 094 

63.0 

.(2)787 4015 748 

.(5)585 8642 670 

63 5 

,(2)781 2500 000 

.(5)572 2395 166 

64 0 

.(2)775 1937 984 

. (5)559 0339 893 

64 5 

.(2)769 2307 692 

.(5)546 2316 842 

65.0 

.(2)763 3587 786 

.(5)533 8173 277 

65 5 

.(2)757 5757 576 

.(5)521 7763 354 

66 0 

.(2)751 8796 992 

.(5)510 0947 756 

66 5 

.(2)746 2686 567 

.(5)498 7593 361 

67,0 

.(2)740 7407 407 

.(5)487 7572 920 

67.5 

.(2)735 2941 176 

.(5)477 0764 754 

68.0 

. (2)729 9270 073 

. (5)466 7052 476 

68 5 

. (2)724 6376 812 

.(5)456 6324 725 

69.0 

.(2)719 4244 604 

.(5)446 8474 910 

69.5 

. (2)714 2857 143 

(5)437 3400 975 

70.0 

.(2)709 2198 582 

. (5)428 1005 180 

70.5 

. (2)704 2253 521 

. (5)419 1193 883 

71.0 

.(2)699 3006 993 

.(5)410 3877 343 

71 5 

. (2)694 4444 444 

.(5)401 8969 536 

72.0 

. (2)689 6551 724 

.(5)393 6387 970 

72.5 

. (2)684 931.5 068 

.(5)385 6053 522 

73.0 

. (2)680 272 1 088 

.(5)377 7890 275 

73 5 

.(2)675 6756 757 

, (5)370 1825 370 

74.0 

.(2)671 1409 396 

. (5)362 7788 863 

74 5 

.(2)666 6666 667 

. (5)355 5713 587 

75.0 

.(2)662 2516 556 

.(5)348 5535 030 
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TAB EE I — (Continued) 


p 

A 

A' 

75.5 

.(2)657 8947 368 

.(5)341 7191 206 

76.0 

.(2)653 5947 712 

.(5) 335 0622 546 

76.5 

.(2)649 3506 494 

. (5)328 5771 787 

77.0 

.(2)645 1612 903 

.(5)322 2583 868 

77.5 

.(2)641 0256 410 

. (5)316 1005 832 

78.0 

.(2)636 9426 752 

.(5)310 0986 734 

78.5 

.(2)632 9113 924 

.(5)304 2477 550 

79.0 

.(2)628 9308 176 

.(5)298 5431 096 

79.5 

1 .(2)625 0000 000 

. (5)292 9801 945 

80.0 

.(2)621 1180 124 

.(5)287 5546 354 

80.5 

.(2)617 2839 506 

. (5)282 2622 188 

81.0 

. (2)613 4969 325 

.(5)277 0988 855 

81.5 

(2)609 7560 976 

.(5)272 0607 240 

82 0 

.(2)606 0606 061 

.(5)267 1439 639 

82.5 

. (2)602 4096 386 

. (5)262 3449 70S 

83.0 

.(2)598 8023 952 

.(5)257 6602 389 

83.5 

. (2)595 2380 952 

. (5)253 0863 885 

84.0 

.(2)591 7159 763 

.(5)248 6201 581 

84.5 

.(2)588 2352 941 

.(5)244 2584 010 

85.0 

.(2)584 7953 216 

.(5)239 9980 800 

85.5 

.(2)581 3953 488 

. (5)235 8362 636 

86.0 

.(2)578 0346 821 

.(5)231 7701 211 

86.5 

.(2)574 7126 437 

. (5)227 7969 190 

87.0 

.(2)571 4285 714 

.(5)223 9140 170 

87.5 

.(2)568 1818 182 

. (5)220 1188 642 

88.0 

.(2)564 9717 514 

. (5)216 4089 957 

88.5 

.(2)561 7977 528 

. (5)212 7820 293 

89.0 

.(2)558 6592 179 

.(5)209 2350 621 

89.5 

. (2)555 5SSS 556 

. (5)205 7676 677 

90.0 

.(2)552 4861 878 

. (5)202 3758 930 

90.5 

. (2)549 4505 495 

. (5)199 0582 554 

91.0 

.(2)546 4480 874 

.(5)195 8127 404 

91.5 

. (2)543 4782 609 

.(5)192 6373 986 

92.0 

.(2)540 5405 405 

.(5)189 5303 438 

92.5 

.(2)537 6344 086 

.(5)186 4897 SOI 

93.0 

.(2)534 7593 583 

. (5) 183 51-38 498 

93.5 

.(2)531 9148 936 

.(5)180 6009 315 

94.0 

.(2)529 1005 291 

.(5)177 7493 378 

94.5 

(2)526 3157 895 

.(5)174 9574 635 

95.0 

.(2)523 5602 094 

.(5)172 2237 531 

95.5 

.(2)520 8333 333 

. (5)169 5466 999 

96.0 

.(2)518 1347 150 

.(5)166 9248 438 

96.5 

.(2)515 4639 175 

.(5)164 3567 692 

97.0 

.(2)512 8205 128 

.(5)161 8411 044 

97,5 

.(2)510 2040 816 

. (5)159 3765 191 

98.0 

. (2)507 6142 132 

. (5)156 9617 233 

98.5 

.(2)505 0505 051 

.(5)154 5954 662 

99.0 

.(2)502 5125 628 

.(5)152 2765 342 

99.5 

.(2)500 0000 000 

.(5)150 0037 501 

100.0 

.(2)497 S124 378 

. (5) 147 7759 716 
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POLYNOMIAL APPROXIMATION 


TABLE II 


(The number in parentheses denote the number of ciphers between the 
decimal point and the first significant figure^) 


p 

A 

3 

c 

1.0 

1.000 0000 000 

- 1.000 0000 000 

1.500 0000 000 

1.5 

,640 6250 000 

- ,312 5000 000 

,250 0000 000 

2 0 

.485 7142 857 

- 142 8571 429 

.( 1)714 2857 143 

2,5 

.394 5312 500 

• ,( 1)781 2500 000 

.( 1)267 8571 429 

3,0 

.333 3333 333 

- ,( 1)476 1904 762 

.( 1)119 0476 190 

3 5 

.289 0625 000 

-.( 1)312 5000 000 

.( 2)595 2380 952 

4.0 

.255 4112 554 

-,( 1)216 4502 165 

.( 2)324 6753 247 

4.5 

.228 9062 500 

-,( 1)156 2500 000 

. ( 2) 189 3939 394 

5 0 

.207 4592 075 

- .( 1)116 5501 166 

.( 2)116 5501 166 

5.5 

,189 7321 429 

- . ( 2)892 8571 429 

.( 3)749 2507 493 

6.0 

.174 8251 748 

( 2)699 3006 993 

.(3 ) 499 5004 995 

6.5 

.162 1093 750 

-,( 2)558 0357 143 

.( 3)343 4065 934 

7,0 

.151 1312 217 

- , ( 2)452 4886 878 

.( 3)2424046542 

7.5 

.141 5550 595 

- . ( 2)372 0238 095 

.( 3)175 0700 280 

8.0 

.133 1269 350 

- .( 2)309 5975 232 

| ( 3) 128 9989 680 

8 5 

.125 6510 417 

- . ( 2)260 4166 667 1 

, ( 4)967 4922 601 

9 0 

.118 9739 054 

- . ( 2)221 1410 880 

.( 4)737 1369 601 

9.5 

.112 9734 848 

- . ( 2) 189 3939 394 

.( 4)569 6058 328 

10,0 

107 5514 874 

-.( 2)163 4521 085 

.( 4)445 7784 778 

10 5 

,102 6278 409 

- . ( 2)142 0454 545 

.( 4)352 9079 616 

11.0 

.( 1)981 3664 596 

- . ( 2) 124 2236 025 

.( 4)282 3263 693 

11.5 

.( 1)940 2316 434 

-.( 2)109 2657 343 

.( 4)228 0328 367 

12 0 

.( 1)902 4154 589 

- . ( 3)966 1835 749 

.( 4)185 8045 336 

12 5 

.( 1)867 5309 066 

-.( 3)858 5164 835 

( 4)152 6251 526 

13.0 

( 1)835 2490 421 

( 3)766 2835 249 

.( 4)126 3104 711 

13.5 

.( 1)805 2884 615 

-.( 3)686 8131 868 

. ( 4) 105 2587 260 

14 0 

.( 1)777 4069 954 

-.( 3)617 9705 846 

.( 5)882 8151 209 

14.5 

.( 1)751 3950 893 

-.( 3)558 0357 143 

.( 5)744 8752 582 

15,0 

.( 1)727 0704 824 

- (3 ) 505 6122 965 

.( 5)632 0153 706 

15.5 

,( 1>704 2738 971 

-.( 3)459 5588 235 

.( 5)539 0719 338 

16.0 

.( 1)682 8655 216 

- ( 3)418 9359 028 

.( 5)462 0616 575 

16.5 

.( 1)662 7221 201 

-.( 3)382 9656 863 

.( 5)397 8864 273 

17.0 

.( 1)643 7346 437 

-.( 3)351 0003 510 

.( 5)344 1179 912 

17 . S 

.( 1)625 8062 436 

- . ( 3)322 4974 200 

.( 5)298 8393 081 

18.0 

.( 1)608 8506 089 

- ( 3)297 0002 970 

.( 5)260 5265 763 

18 5 

.( 1)592 7905 702 

-.( 3)274 1228 070 

. ( 5)227 9607 543 

19 0 

.( 1)577 5569 190 

-.( 3)253 5368 389 

( 5)200 1606 623 

19 5 

.( 1)563 0874 060 

-,( 3)234 9624 060 

. ( 5) 176 3320 120 

20.0 

.( 1)549 3258 868 

- ( 3)218 1596 056 

. ( 5) 155 8282 897 

20.5 

.( 1)536 2215 909 

- ( 3)202 9220 779 

.( 5)138 1205 295 

21.0 

.( 1)523 7284 931 

-.( 3)189 0716 582 

.( 5)122 7738 040 

21.5 

.( 1)511 8047 713 

-.( 3)176 4539 808 

.( 5)109 4288 253 

22,0 

.( 1)500 4123 371 

-.( 3)164 9348 507 

.( 6)977 8746 091 

22 5 

.( 1)489 5164 279 

-,( 3)164 3972 332 

, ( 6)876 0126 706 

23 0 

.( 1)479 0852 511 

-.( 3)144 7387 466 

,( 6)785 2017 048 

23,5 

( 1)469 0896 739 

-.( 3)135 8695 652 

.( 6)707 9612 604 

24.0 

. ( 1)459 5029 501 

-.( 3)127 7106 587 

.( 6)638 5532 937 

24.5 

.( 1)450 3004 808 

-,( 3)120 1923 077 

.( 6)577 1539 385 

25.0 

.( 1)441 4596 027 

- ( 3)113 2528 483 

. ( 6)522 7054 537 
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POL YNOMIAL APPROXIMATION 


TABLE II — (Continued) 



A 

23 

C 

50.5 

.( 1)220 6235 860 

— . ( 4)141 4027 149 

.( 7)163 1099 278 

51.0 

.( 1)218 4809 327 

-.( 4)137 3230 250 

.( 7)155 3427 884 

51.5 

.( 1)216 3795 035 

-.( 4)133 3987 877 

. ( 7)148 0152 984 

52.0 

.( 1)214 3181 200 

( 4)129 6226 684 

.( 7)141 0986 957 

52.5 

.( 1)212 2956 479 

-.( 4)125 9877 439 

.( 7)134 5663 486 

53.0 

.( 1)210 3109 957 

-,( 4)122 4874 757 

( 7)128 3935 804 

53.5 

.( 1)208 3631 123 

-.( 4)119 1156 852 

.( 7)122 5575 085 

54.0 

.( 1)206 4509 850 

-,( 4)115 8665 310 

.( 7)117 0369 000 

54 5 

.( 1)204 5736 382 

-. ( 4)112 7344 877 

.( 7)111 8120 384 

55.0 

.( 1)202 7301 313 

-.( 4)109 7143 258 

.( 7)106 8646 031 

55.5 

.( 1)200 9195 574 

-.( 4)106 8010 936 

.( 7)102 1775 591 

56 0 

.( 1)199 1410 418 

( 4)103 9901 001 

.( 8)977 3505 652 

56.5 

.( 1)197 3937 403 

-.( 4)101 2768 991 

. ( 8)935 2233 857 

57,0 

.( 1)195 6768 382 

(51986 5727 449 

( 8)895 2565 744 

57.5 

( 1)193 9895 490 

-.( 5)961 2722 631 

.( 8)857 3219 737 

58 0 

.( 1)192 3311 130 

-.( 5)936 8295 813 

.( 8)821 3000 421 

58.5 

.( 1)190 7007 963 

- ( 5)913 2086 499 

( 8)787 0792 070 

59.0 

.( 1)189 0978 896 

-.( 5)890 3752 219 

.( 8)754 5552 728 

59.5 

.( 1)187 5217 074 

-.( 5)868 2967 491 

.( 8)723 6308 764 

60.0 

.( 1)185 9715 868 

-. ( 5)846 9422 843 

.( 8)694 2149 871 

60.5 

.( 1)184 4468 866 

- ( 5)826 2823 903 

.( 8)666 2224 473 

61.0 

.( 1)182 9469 865 

-.( 5)806 2890 546 

.( 8)639 5735 494 

61.5 

.( 1)181 4712 863 

-.( 5)786 9356 098 

.( 8)614 1936 467 

62.0 j 

.( 1)1800192049 

-. ( 5)768 1966 583 

.( 8)590 0127 944 

62.5 

.( 1)178 5901 798 

-.( 5)750 0480 031 

.( 8)566 9654 196 

63,0 

.( 1)177 1836 660 

-.( 5)732 4665 812 

.( 8)544 9900 158 

63 5 

.( 1)175 7991 358 

-.( 5)715 4304 029 

.( 8)524 0288 613 

64.0 

.( 1)174 4360 776 

-.( 5)698 9184 935 

( 8)504 0277 597 

64 5 

.( 1)173 0939 958 

-.( 5)682 9108 392 

( 8)484 9357 992 

65.0 

.( 1)171 7724 099 

-.( 5)667 3883 359 

. ( 8)466 7051 300 

65.5 

.( 1)170 4708 538 

-.( 5)652 3327 419 

.( 8)449 2907 595 

66,0 

.( 1)169 1888 755 

-.( 5)637 7266 321 

. ( 8)432 6503 610 

66.5 

.( 1)167 9260 366 

-.( 5)623 5533 562 

.( 8)416 7440 977 

67.0 

.( 1)166 6819 116 

-.( 5)609 7969 986 

.( 8)401 5344 591 

67.5 

.( 1)165 4560 875 

-.( 5)596 4423 407 

. ( 8)386 9861 091 

68,0 

.( 1)164 2481 635 

-.( 5)583 4748 260 

. ( 8)373 0657 455 

68,5 

( 1)163 0577 503 

-.( 5)570 8805 261 

.( 8)359 7419 689 

69 0 

" ( 1)161 8844 697 

-.( 5)558 6461 100 

.( 8)346 9851 615 

69.5 

.( 1)160 7279 547 

-.( 5)546 7588 138 

.( 8)334 7673 741 

70.0 

.( 1)159 5878 482 

- . ( 5)535 2064 131 

.( 8)323 0622 212 

70.5 

.( 1)158 4638 037 

-.( 5)523 9771 965 

.( 8)311 8447 829 

71.0 

.( 1)157 3554 838 

-.( 5)513 0599 408 

.( 8)301 0915 145 

71.5 

.( 1)156 2625 611 

-.( 5)502 4438 871 

. ( 8)290 7801 613 

72 0 

.( 1)155 1847 168 

-.( 5)492 1187 187 

.( 8)280 8896 796 

72 5 

.( 1)154 1216 409 

-.( 5)482 0745 403 

.( 8)271 4001 634 

73.0 

.( 1)153 0730 320 

-.( 5)472 3018 576 

. ( 8)262 2927 754 

73 5 

.( 1)152 0385 968 

-.( 5)462 7915 587 

. ( 8)253 5496 829 

74.0 

.( 1)151 0180 498 

-.( 5)453 5348 963 

.( 8)245 1539 980 

74.‘5 

.( 1)150 0111 131 

- ( 5)444 5234 708 

.( 8)237 0897 218 

75,0 

,( 1)149 0175 162 

-,( 5)435 7492 141 

.( 8)229 3416 916 
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TABLE II — (Continued) 

n 3 


C 


75.5 

76.0 

76.5 

77.0 

77.5 

78.0 

78.5 

79.0 

79.5 
80 0 


.( 1)148 0369 959 
.( 1)147 0692 956 
.( 1)146 1141 654 
.( 1)145 1713 622 
.( 1)144 2406 486 
.( 1)143 3217 937 
.( 1)142 4145 722 
.( 1)141 5187 645 
.( 1)140 6341 567 
.( 1)139 7605 399 


( 5)427 2043 746 
-.( 5)418 8815 026 
-.( 5)410 7734 371 
-.( 5)402 8732 923 
-.( 5)395 1744 458 
-.( 5)387 6705 266 
-.( 5)380 3554 041 
-.( 5)373 2231 778 
-.( 5)366 2681 669 
-.( 5)359 4849 013 


. ( 8)221 8955 328 
.( 8)214 7376 124 
. ( 8)207 8549 966 
.( 8)201 2354 107 
.( 8)194 8672 015 
.( 8)188 7393 021 
,( 8)182 8411 989 
.( 8)177 1629 008 
.( 8)171 6949 101 
( 8)166 4281 950 


80.5 
81.0 

81.5 
82.0 

82.5 

83.0 

83.5 

84.0 

84.5 

85.0 


.( 1)138 8977 106 
.( 1)138 0454 701 
.( 1)137 2036 248 
.( 1)136 3719 855 
.( 1)135 5503 678 
.( 1)134 7385 917 
.( 1)133 9364 812 
.( 1)133 1438 649 
.( 1)132 3605 750 
.( 1)131 5864 481 


-.( 5)352 8681 120 
-.( 5)346 4127 230 
-.( 5)340 1138 429 
-.( 5)333 9667 569 
-.( 5)327 9669 199 
- . ( 5)322 1099 490 
-.( 5)316 3916 169 
-.( 5)310 8078 455 
-.( 5)305 3547 000 
-.( 5)300 0283 827 


. ( 8) 161 3541 647 
.( 8)156 4646 446 
. ( 8)151 7518 541 
.( 8)147 2083 854 
. ( 8) 142 8271 834 
. ( 8)138 6015 271 
.( 8)134 5250 116 
.( 8)130 5915 317 
.( 8)126 7952 663 
.( 8)123 1306 632 


85.5 
86.0 

86.5 

87.0 

87.5 

88.0 

88.5 

89.0 

89.5 

90.0 


.( 1)130 8213 241 
.( 1)130 0650 470 
.( 1)129 3174 642 
.( 1)128 5784 266 
.( 1)127 8477 885 
.( 1)127 1254 075 
.( 1)126 4111 445 
.( 1)125 7048 632 
.( 1)125 0064 308 
.( 1)124 3157 171 


-.( 5)294 8252 276 
-.( 5)289 7416 953 

- . ( 5)284 7743 676 

- . ( 5)279 9199 429 
-.( 5)275 1752 316 
-.( 5)270 5371 515 
-.( 5)266 0027 239 
-.( 5)261 5690 691 
-.( 5)257 2334 033 
-.( 5)252 9930 341 


.( 8)119 5924 258 
.( 8)116 1754 993 
.( 8)112 8750 590 
. ( 8)109 6864 980 
.( 8)106 6054 166 
.( 8)103 6276 117 
. ( 8)100 7490 669 
. ( 9)979 6594 350 
. ( 9)952 7457 143 
. ( 9)926 7144 106 


90.5 

91.0 

91.5 

92.0 

92.5 

93.0 

93.5 

94.0 

94.5 

95.0 


.( 1)123 6325 948 
.( 1)122 9569 394 
.( 1)122 2886 291 
.( 1)121 6275 450 
.( 1)120 9735 702 
.( 1)120 3265 909 
.( 1)119 6864 953 
.( 1)119 0531 742 
.( 1)118 4265 205 
.( 1)117 8064 296 


-.( 5)248 8453 575 
-.( 5)244 7878 546 
-. ( 5)240 8180 879 
-.( 5)236 9336 988 
-.( 5)233 1324 043 
-. ( 5)229 4119 941 
-.( 5)225 7703 284 
-.( 5)222 2053 346 
-.( 5)218 7150 056 
-.( 5)215 2973 968 


.( 9)901 5319 538 
. ( 9)877 1662 253 
.( 9)853 5864 881 
.( 9)830 7633 199 
.( 9)808 6685 508 
.( 9)787 2752 029 
.( 9)766 5574 344 
.( 9)746 4904 858 
.( 9)727 0506 294 
.( 9)708 2151 209 


95.0 

96.0 

96.5 

97.0 

97.5 

98.0 

98.5 

99.0 

99.5 

100.0 


.( 1)117 1927 988 
.( 1)116 5855 277 
1)115 9845 180 
1)115 3896 733 
.( 1)114 8008 993 
.( 1)114 2181 034 
.( 1)113 6411 951 
.( 1)113 0700 856 
.( 1)112 5046 880 
■( 1)111 9449 168 


-.( 5)211 9506 240 
-.( 5)208 6728 615 
-.( 5)205 4623 396 , 
-.( 5)202 3173 428 
-.( 5)199 2362 08 l 
-.( 5)196 2173 225 
-.( 5)193 2591 218 
-.( 5)190 3600 889 
-.( 5)187 5187 519 
-.( 5)184 7336 824 


. ( 9)689 9621 539 
. ( 9)672 2708 166 
. ( 9)655 1210 509 
.( 9)638 4936 130 
.( 9)622 3700 369 
. ( 9)606 7325 988 
.( 9)591 5642 838 
. ( 9)576 8487 544 
.( 9)562 5703 199 
■( 9)548 7139 081 
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POLYNOMIAL APPROXIMATION 


TABLE III 

(The numbets in paicntheses denote the number of ciphers between the 
decim al point and the first significant figure.) 


2.534 7222 222 
,902 7777 778 
.450 0999 559 
,262 5061 376 
.167 8541 366 
.114 3378 227 
.(1)816 0531 598 
. (1)603 7943 538 

.(1)459 7794 181 
.(1)358 4609 835 
.(1)285 0269 624 
.(1)230 4589 927 
.(1)189 0399 941 
.(1)157 0204 105 
.(1)131 8685 978 
.(1)111 8316 811 
.(2)956 6897 397 
.(2)824 8507 009 


.(2)716 ; 
.(2)625 < 
.(2)550 1 
.(2)486 : 
.(2)431 I 
.(2)385 ; 
.(2)345 
.(2)310 - 
.(2)280 ; 
.(2)253 

.(2)230 

.( 2)210 

.(2)192 

.(2)176 

.(2)161 

(2) 148 
,(2)137 
.(2)127 
* (2)117 
.(2)109 

.( 2)101 

(3) 947 
,(3)884 
.(3)826 
.(3)773 
.(3)725 
,(3)680 
.(3)639 
.(3)601 
.(3)567 


2246 443 
9079 085 
1917 791 
2354 500 
8375 890 
2742 263 
1820 327 
4731 565 
2723 203 
8698 342 

6861 266 
2447 093 
1513 833 
0781 076 
7503 875 
9373 393 
4438 092 
1040 798 
7768 137 
3409 664 

6924 641 
4149 003 
1025 513 
3115 900 
4526 546 
0103 342 
) 5325 601 
l 6217 018 
. 9270 658 
' 1385 543 


- 1.138 i 
-.236 1 

,(1)779 < 
.(1)324 C 
.(1)155 / 
.(2)827 ; 
.(2)474 2 
. (2)288 1 

.(2)183 - 
.( 2)121 - 
.(3)829 i 
,(3)583 I 
,(3)419 ! 
.(3)308 
.(3)230 , 
.(3)175 ; 
,(3)135 
.(3)105 - 

.(4)835 ■ 

. (4)607 . 

.(4)538 

.(4)438 

.(4)359 

.(4)297 

,(4)247 

.(4)207 

.(4)175 

.(4)148 

.(4)126 
(4)108 
.(5)933 
.(5)807 
.(5)700 
,(5)610 
■.(5)534 
. (5)469 
, (5)413 
.(5)365 

■ ,(5)323 

■ .(5)287 
•.(5)256 
• .(5)228 
-.(5)205 
-.(5)184 
-.(5)165 
-.(5)149 
-.(5)135 

- .(5)122 


8888 889 

mi -in 

3209 877 
0740 741 
7239 057 
7216 611 
2942 243 
1377 881 


.555 5555 556 
.(1)694 4444 444 
.(1)154 3209 877 
. (2)462 9629 630 
.(2)168 3501 684 
.(3)701 4590 348 
,(3)323 7503 238 
.(3)161 8751 619 


4585 167 
4063 714 
8482 563 
0679 850 
5222 848 
1642 014 
5259 336 
2561 737 
1741 492 
6201 476 

0091 302 
2071 889 
3326 639 
2359 455 
6848 299 
4533 626 
7164 138 
6407 573 
1046 701 
5029 580 

6096 151 
4799 077 
7977 791 
3440 736 
9036 937 
8752 239 
3795 459 
1008 114 
1653 981 
0491 008 

5054 757 
5101 521 
2172 813 
9252 750 

■ 0496 990 

■ 1017 105 
! 6708 183 
1 4110 299 
i 0296 678 
! 2783 008 


.(4)863 ; 
.(4)485 i 
(4)285 i 
,(4)174 , 
.(4)110 : 
.(5)716 
.(5)477 
.(5)325 
.(5)226 
.(5)160 

.(5)115 
.(6)844 
.(6)625 
.(6)469 
. (6)35S 
.(6)272 
.( 6)211 
.(6)165 
.(6)130 
.(6)103 

.(7)826 

,(7)665 

.(7)539 

.(7)440 

.(7)361 

.(7)297 

.(7)247 

.(7)205 

.(7)172 

(7)144 

.( 7)122 

(7) 103 

(8) 883 
.(8)754 
.(8)646 
.(8)556 

(8)479 

.(8)415 

,(8)360 

.(8)313 


3341 967 
6254 856 
6620 504 
5712 530 
2555 282 
6609 334 
7739 556 
7549 697 
6121 528 
5169 416 

5721 980 
5660 620 
6044 903 
2033 678 
9473 824 
8929 932 
2719 947 
0562 459 
0443 149 
2704 854 

1638 832 
5209 059 
6115 453 
2094 185 
1974 716 
9879 141 
1119 288 
9266 073 
4036 712 
9758 144 

4240 211 
7942 787 
3555 638 
5328 774 
7424 663 
1985 210 
8575 476 
2613 392 
4155 020 
6949 739 


//. r. DAVIS 


181 


T AbLIL III— (Continue d) 




S' 


C' 


25 5 
2 <> 0 

26 5 

27 0 

27.5 
28.0 

28.5 
29 0 

29.5 
30.0 


(3)534 9812 832 
.(3)505 2110 028 
(3)477 6101 850 
.(3)451 9846 731 
.(3)428 1608 021 
.(3)405 9829 189 
.(3)385 3112 389 
.(3)366 0199 892 
.(3)347 9957 969 
.(3)331 1362 854 


- .(5)110 9453 570 
-.(5)100 8500 824 
>-.(6)918 3758 072 
-.(6)837 7472 451 
-.(6)765 4677 208 
-.(6)700 5455 924 
-.(6)642 1215 945 
-.(6)589 4492 824 
-.(6)541 8786 342 
-.(6)498 8422 596 


.(8)273 7701 591 
. (8)239 5488 892 
.(8)210 1306 046 
. (8) 184 7701) 144 
. (8) 162 mi 482 
.(8)143 8491 976 
.(8)127 3419 126 
,(8)112 9645 999 
. (8) 100 4129 777 
. (9)894 3030 827 


30.5 

31.0 

31.5 

32.0 

32.5 

33.0 

33.5 

34.0 

34.5 

35.0 


.(3)315 3488 491 
.(3)300 5495 828 
.(3)286 6623 418 
.(3)273 6179 176 
.(3)261 3533 111 
.(3)249 8110 923 
.(3)238 9388 342 
.(3)228 6886 114 
.(3)219 0165 553 
.(3)209 8824 591 


.(6)459 
.(6)424 
,(6)392 
.(6)362 
.(6)336 
.(6)311 
.(6)289 
.(6)269 
. (6)250 
. (6)233 


8437 659 
4479 170 
2722 841 
9801 450 
2744 286 
8925 371 
6019 105 
1962 143 
4920 591 
3261 690 


.(9)797 9935 200 
.(9)713 3578 436 
.(9)638 8279 197 
.(9)573 0662 221 
.(9)514 9290 691 
.(9)463 4361 622 
.(9)417 7452 730 
.(9)377 1311 492 
.(9)340 9678 883 
.(9)308 7141 691 


35.5 

36.0 

36.5 

37.0 

37.5 
38 0 
38 5 

39.0 

39.5 

40.0 


.(3)201 2494 260 
.(3)193 0835 554 
.(3)185 3536 630 
.(3)178 0310 300 
.(3)171 0891 785 
.(3)164 5036 705 
.(3)158 2519 263 
.(3)152 3130 623 
.(3)146 6677 431 
.(3)141 2980 508 


-.(6)217 5529 331 
-.(6)203 0422 839 
-.(6)189 6778 555 
-.(6)177 3553 804 
-.(6)165 9829 799 
-.(6)155 4715 079 
-. (6; 145 7503 555 
-.(6)136 7496 434 
-.(6)128 4078 366 
-.(6)120 6693 349 


.(9)279 9008 467 
.(9)254 1205 055 
,(9)231 0186 414 
.(9)210 2861 992 
.(9)191 6532 449 
,(9)174 8835 859 
.(9)159 7701 89/5 
.(9)146 1312 710 
.(9)133 8069 469 
,(9)122 6563 680 


40.5 

41.0 

41.5 

42.0 

42.5 

43.0 

43.5 
44 0 

44.5 

45.0 


.(3)136 1873 638 
.(3)131 3202 493 
.(3)126 6823 653 
.(3)122 2603 712 
.(3)118 0418 479 
.(3)114 0152 237 
.(3)110 1697 079 
.(3)106 4952 300 
. (30102 9823 841 
.(4)996 2237 840 


-.(6)113 4838 362 
-.(6)106 8057 759 
(6) 100 5938 300 
-.(7)948 1047 667 
-.(7)894 2160 708 
-.(7)843 9617 986 
-.(7)797 0591 436 
-.(7)753 2501 737 
-.(7)712 2993 971 
-.(7)673 9915 889 


.(9)112 5552 554 
.(9)103 3937 811 
.(10)950 7474 123 
.(10)875 1197 773 
.(10)806 2901 319 
.(10)743 5786 772 
.(10)686 3803 174 
.(10)634 1557 280 
.(10)586 4235 765 
.(10)542 7537 357 


45.5 

46.0 

46. 5 

47.0 

47.5 

48.0 

48.5 

49.0 

49.5 

50.0 


.(4)964 0698 884 
.(4)933 2851 680 
.(4)903 7975 033 
.(4)875 5392 866 
.(4)848 4470 959 
.(4)822 4613 955 
.(4)797 5262 609 
.(4)773 5891 262 
.(4)750 6005 505 
(4)728 5140 038 


-.(7)638 1298 500 
-.(7)604 5338 699 
-.(7)573 0383 707 
-.(7)543 4917 120 
-,(7)515 7J46 374 
-.(7)489 6991 482 
-.(7)465 2074 902 
- (7)442 1712 397 
-. (7)420 4904 806 
-.(7)400 0730 601 


, (10)502 7613 551 
.(10)466 1016 730 
.(10)432 4654 698 
.(10)401 5750 791 
,(10)373 1808 816 
,(10)347 0582 199 
.(10)323 0046 799 
,(10)300 8376 920 
.(10)280 3924 120 
. (10)261 5198 458 
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POLYNOMIAL APPROXIMATION 


TABLE IV 


(The numbers in parentheses denote the number of ciphers between tiie 
decimal point and the first significant figure ) 


/O 

A' 

<5 

c 

2,0 

1.000 0000 000 

- 1 250 0000 000 

250 0000 000 

2.5 

.70S 9936 52 3 

495 6054 688 

.(1 )615 2343 750 

3.0 

.567 0995 671 

,265 1515 152 

(1)227 2727 273 

3.5 

.479 4006 348 

162 3535 156 

.(1)102 5390 625 

4.0 

.417 2494 172 

.107 8088 578 

.(2)524 4755 245 

4,5 

.370 3002 930 

-.(1)756 8359 375 

.(2)292 9687 500 

5.0 

.333 3333 333 

(1)553 6130 536 

. (2)174 8251 748 

5 5 

.303 3523 560 

-.(1)418 0908 203 

.(2)109 8632 813 

6.0 

.278 4862 197 

- . (1)323 9407 651 

.(3)719 8683 669 

6.5 

.257 4942 453 

-.(1)256 3476 563 

.(3)488 2812 500 

7.0 

.239 5159 021 

- (1)206 4885 579 

. (3)340 9902 791 

7.5 

.223 9329 020 

(1)168 8639 323 

. (3)244 1406 250 

8.0 

.210 2881 638 

-.(1)139 9142 653 

.(3)178 6139 557 

8.5 

.198 2357 141 

-.(1)117 2614 820 

(3)133 1676 136 

9,0 

.187 5084 130 

-.(2)992 7311 886 

.(3)100 9557 141 

9.5 

.177 8964 418 

(2)848 0187 618 

(4)776 8110 795 

10.0 

169 2325 443 

-.(2)730 2463 319 

. (4)605 7342 846 

10.5 

.161 3816 481 i 

- (2)633 3998 033 

. (41478 0375 874 

11.0 

.154 2334 096 

-.(2)553 0129 672 

.(4)381 3882 532 

11.5 

.147 6967 551 

-.(2)485 7203 344 

. (4)307 3098 776 

12.0 

.141 6958 188 

-.(2)428 9521 906 

(4)249 8750 625 

12.5 

.136 1668 726 

-. (2)380 7227 928 

.(4)204 8732 517 

13 0 

.131 0559 774 

-.(2)339 4807 972 

.(4)169 2702 036 

13.5 

.126 3171 605 

- . (2)304 0020 282 

.(4)140 8503 606 

14 0 

.121 9109 898 

- . (2)273 3115 358 

.(4)117 9762 025 

14 5 

.117 8034 442 

- (2)246 6 262 196 

.(5)994 2378 394 

15.0 

.113 9650 116 

-.(2)223 3120 976 

. (5)842 6871 608 

15.5 

110 3699 628 

-.(2)202 8521 369 

.(5)718 0606 618 

16.0 

.106 9957 611 

- (2)184 8217 922 

.(5 ) 614 9338 741 

16.5 

.103 8225 802 

-.(2)168 8702 311 

.(5)529 0973 297 

17 0 

.100 8329 073 

-.(2)154 7057 999 

.(5)457 2585 218 

17.5 

.(1)980 1121 368 

-.(2)142 0846 788 

.(5)396 8229 973 

18,0 

,(1)953 4368 071 

— . (2) 130 8019 601 

.(5)345 7320 530 

18.5 

.(1)928 1796 986 

- (2)120 6845 814 

.(5)302 3413 313 

19.0 

.(1)904 2302 916 

-.(2)111 5856 901 

(5)265 3292 500 

19.5 

.(1)881 4892 929 

-. (2)103 3801 211 

.(5)233 6273 923 

20.0 

,(1)859 8672 410 

-.(3)959 6074 542 

.(5)206 3671 945 

20.5 

.(1)839 2833 163 

-.(3)892 3550 616 

.(5)182 8388 288 

21 0 

.(1)819 6643 189 

-.(3)831 2499 865 

(5)162 4592 807 

21.5 

.(1)800 9437 893 

-.(3)775 6048 512 

. (5)144 7474 061 

22,0 

,(1)783 0612 483 

- (3)724 8225 801 

.(5)129 3043 255 

22.5 

.(1)765 9615 377 

-.(3)678 3828 434 

.(5)115 7979 249 

23.0 

.(1)749 5942 464 

-,(3)635 8307 796 

.(5)103 9505 362 

23 5 

.(1)733 9132 094 

-.(3)596 7675 752 

. (6)935 2909 319 

24.0 

.(1)718 8760 691 

-.(3)560 8425 626 

< (6)843 3722 746 

24.5 

.(1)704 4438 900 

-.(3)527 7465 684 

.(6)762 0889 075 

25.0 

.(1)690 5808 198 

-.(3)497 2062 910 

.(6)690 0318 611 
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TABLE IV- 



-(Continued) 


^5 


.538 : 
.(1)824 ( 
.( 1)211 < 
.(2)706 2 
.(2)279 2 
.(2)124 - 
.(3)607 ( 

.(3)317 ! 
.(3)176 . 
.(3)102 i 
.(4)622 
. (4)390 
. (4)252 
. (4X167 
. (4)113 
. (5)788 
. (5)556 

. (5)399 
,(5)291 
.(5)215 
■.(5)161 
.(5)122 
.(6)936 
.(6)723 
. (6)564 
.(6)444 
. (6)352 

.(6)282 
, (6)227 
■. (6)183 
.(6)150 
•.(6)123 
-.( 6)101 
-.(7)840 
•.(7)700 
■,( 7)586 
'.(7)492 

•.(7)415 
(7)352 
-.(7)299 
-.(7)256 
.. (7)219 
(7)188 
-.(7)162 
*.(7)140 
(7)122 
-.(7)106 


1944 444 
6527 778 
4898 990 
2815 657 
2346 542 
4415 307 
0318 570 


9329 351 
3963 161 
6597 994 
0667 018 
2012 053 
2095 208 
3566 157 
6601 579 
0526 136 
6161 004 

7797 905 
5234 609 
5402 971 
3897 791 
2557 913 
0842 049 
8747 984 
9425 143 
6919 780 
8408 251 

0575 132 
0555 132 
9826 864 
0043 633 
0152 522 
4381 253 
8165 075 
3899 143 
1479 930 
7218 558 

9387 633 
5353 142 
9440 718 
1336 780 
4882 245 
7156 473 
: 7777 261 
8362 750 
: 2115 427 
i 3498 773 


.121 5 
.(1)121 5 
.( 2)220 £ 
.(3)552 3 
.(3)169 S 
.(4)607 ( 
.(4)242 E 

.(4)106 ; 

.(5)499 

.(5)249 

.(5)131 

.(6)723 

.(6)413 

.(6)244 

.(6)148 

.(7)929 

.(7)594 

.(7)388 

.(7)259 

.(7)175 

.(7)121 

.(8)849 

.(8)602 

(8)433 

.(8)315 

.(8)231 

.(8)172 

.(8)129 

.(9)976 

.(9)745 

.(9)573 

.(9)444 

.(9)346 

.(9)272 

.(9)215 

.(9)171 

.(9)137 

.(9)110 

,(10)891 

.(10)724 

,(10)591 

.(10)485 

.(10)399 

.(10)330 

.(10)274 

,(10)228 

.(10)191 


5277 778 
5277 778 
9595 960 
3989 899 
9689 200 
0318 570 
8127 428 

2305 750 
9085 881 
9542 941 
5548 916 
5519 039 
4582 308 
3162 273 
7142 253 
4639 082 
8569 012 

9448 970 
2965 980 
9512 629 
3456 986 
4198 899 
8141 154 
2726 455 
1073 785 
6966 018 
1174 757 

0881 067 
8829 699 
i 5159 507 
i 4738 083 
t 4422 014 
i 8817 182 
! 5499 214 
! 5045 890 
4241 049 
' 1392 839 

► 3076 849 
. 8493 673 
\ 6276 109 
. 5327 436 
5 0568 497 
) 4585 821 
) 3215 199 
\ 2291 863 
i 5243 219 
l 1294 329 
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Pol. YNOMlAl APPROXIMA TION 


TABLE V 


(The numbers in parentheses denote the number of ciphers between the 
decimal point and the first significant figure ) 


2 5 2.755 1030 816 - 1.605 6163 194 

30 1 170 5555 556 - 456 9444 444 

3.5 .658 2671 3-7 - .181 6553 455 

4.0 . 418 6208 236 -.( 1)868 9782 440 

4 5 .286 4246 324 -.( 1)466 0481 012 

5.0 .206 0275 835 - . ( 1)270 7119 270 

5.5 .153 7850 411 -.( 1)166 9293 642 

6.0 .118 1431 967 -.( 1)107 8702 752 

6 .5 .( 1)928 9213 334 -.( 2)724 0120 171 

7.0 .( 1)744 5226 377 -.( 2)501 4851 420 

7.5 .( 1)606 4555 217 -.( 2)356 7180 657 

8 0 .( 1)500 8805 006 -.( 2)259 6021 548 

8.5 .( 1)418 6842 624 -.( 2)192 7113 758 

9.0 .( 1)353 6828 101 -.( 2)145 5695 568 

9.5 .( 1)301 5707 729 -.( 2)111 6692 311 

10 0 .( 1)259 2832 573 -.( 3)868 5125 822 

10.5 .( 1)224 5954 598 -. ( 3)683 8979 612 

11 0 . ( 1)195 8642 756 -. ( 3)544 5806 418 

11 5 .( 1)171 8573 720 -.( 3)438 0711 370 

12 0 .( 1)151 6376 163 - ( 3)355 6755 034 

12.5 .( 1)134 4833 894 -. ( 3)291 2423 953 

13.0 . ( 1)119 8326 681 -. ( 3)240 3543 455 

13 5 .( 1)107 2431 572 -.( 3)199 7956 319 

14.0 ( 2)963 6345 096 -. ( 3)167 1959 380 

14.5 .( 2)869 1189 432 - ( 3)140 7878 615 

15 0 .( 2)786 6089 931 - ( 3)119 2394 389 

15.5 . ( 2)714 2517 761 -. ( 3)101 5368 865 

16 0 .( 2)650 5281 970 -. ( 4)869 0145 535 

16.5 .( 2)594 1846 661 -. ( 4)747 2977 719 

17 0 .( 2)544 1803 654 -. ( 4)645 5059 265 

17.5 .( 2)499 6461 807 - . ( 4)559 9303 930 

18.0 .( 2)459 8524 699 -.( 4)487 6318 743 

18.5 ( 2)424 1835 784 - ( 4)426 2652 106 

19.0 .( 2)392 1175 513 -.( 4)373 9474 673 

19.5 .( 2)363 2098 774 -.( 4)329 1578 113 

20.0 .( 2)337 0803 860 -. ( 4)290 6609 405 

20 5 .( 2)313 4026 161 -.( 4)257 4481 065 

21.0 ( 2)291 8951 572 -, ( 4)228 6913 803 

21.5 .( 2)272 2750 483 - ( 4)203 7079 595 

22.0 .( 2)254 4494 241 - ( 4)181 9321 409 

22.5 .( 2)238 1156 895 -. ( 4)162 8931 866 

23 0 .( 2)223 1524 850 -.( 4)146 1977 446 

23 5 ( 2)209 4188 368 -.( 4)131 5158 166 

24 0 .( 2)106 7908 418 -.( 4)118 5694 971 

24 5 . ( 3)186 2576 776 -. ( 4)107 7593 614 

25.0 1 .( 2)174 4277 221 -. ( 5)969 7978 581 


.200 8159 722 
.( 1)363 8888 889 
( 1)104 8944 979 
.( 2)382 4786 325 
.( 2)162 0459 402 
.( 3)763 8888 889 

. ( 3)390 4384 270 
.( 3)212 7325 289 
.( 3)122 1004 174 
.( 4)731 8541 452 

( 4 ) 455 0831 382 
.( 4)292 0668 942 
.( 4)192 6724 344 
.( 4)130 2141 757 
. ( 5)899 1006 061 
. ( 5)632 8140 010 

.( 5)453 1298 476 
. ( 5)329 5585 675 

( 5 ) 243 1030 464 
. ( 5)181 6613 061 
, ( 5)137 3671 277 
.( 5)105 0128 024 
. ( 6)810 9219 640 
. ( 6)632 0799 809 
. ( 6)496 9749 385 
.( 6)393 9212 946 

.( 6)314 6050 442 
. ( 6)253 0429 506 
. ( 6)204 8829 224 
. ( 6)166 9275 426 
. ( 6)136 8055 923 
. ( 6)112 7430 005 
( 7)934 0140 025 
.( 7)777 6318 298 
. ( 7)650 4887 492 
( 7)546 5721 085 

. ( 7)461 2129 835 
. ( 7)390 7629 285 
.( 7)332 3537 754 
.( 7)283 7178 436 
, ( 7)243 0524 324 
( 7)208 9170 140 
.( 7)180 1547 432 
.( 7)155 8321 715 
.( 7)135 9946 487 
.( 7)117 6202 932 
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TABLE V — {Continued) 


2.5 

3.0 

3.5 

4.0 

4 5 

5.0 

5 5 

6.0 

6.5 
7 0 

7.5 
8.0 

8.5 
9 0 

9.5 
10.0 

10.5 
11.0 

11.5 
12.0 

12 5 

13.0 

13 5 

14.0 

14 5 

15 0 

15.5 

16 0 

16.5 

17 0 

17.5 

18 0 

18.5 

19 0 

19.5 

20 0 

20 5 

21.0 

21 5 

22 0 

22.5 

23.0 

23.5 

24.0 

24.5 

25.0 


1.151 0416 667 - .140 

.203 1250 000 -.( 1)170 

.( 1)579 2905 012 -.( 2)355 

.( 1)210 1544 289 -.( 3)988 

. ( 2)887 9662 005 - ( 3)331 

.( 2)417 9414 336 -.( 3)126 

.(2 ) 213 4163 324 -.( 4 ) 538 

.( 2)116 2108 929 -.( 4)247 

( 3 ) 666 7389 843 -.( 4)121 

.( 3)399 5246 884 -.( 5)630 

.( 3)248 3850 322 -.( 5)343 

.( 3)159 3881 480 -.( 5)194 

.( 3)105 1352 375 -.( 5)113 

.( 4)710 4822 114 -.( 6)689 

.( 4)490 5434 324 -.( 6)428 

.( 4)345 2433 425 -.( 6)273 

.( 4)247 2044 428 -.( 6)177 

( 4 ) 179 7851 483 -.( 6)118 

.( 4)132 6177 560 -.( 7)799 

. ( 5)990 9817*818 -.( 7)549 

.( 5)749 3410 036 -.( 7)384 

.( 5)572 8402 600 -.( 7)272 

.( 5)442 3498 419 -.( 7)195 

.( 5)344 7903 612 -.( 7)141 

.( 5)271 0905 849 -.( 7)104 

.( 5)214 8754 286 -.( 8)771 

.( 5)171 6092 639 -.( 8)578 

.( 5)138 0280 549 -.( 8)437 

.( 5)111 7576 435 -.( 8)333 

.( 6)910 5379 546 -.( 8)256 

.( 6)746 2299 456 -.( 8)198 

.( 6)614 9748 289 -.( 8)154 

.( 6)509 4718 636 -.( 8)121 

.( 6)424 1700 765 -.( 9)959 

( 6)354 8175 306 -.( 9)762 

.( 6)298 1344 148 -.( 9)609 

.( 6)251 5739 152 -.( 9)490 

.( 6)213 1458 799 -.( 9)396 

.( 6)181 2857 886 -.( 9)321 

.( 6)154 7566 781 -.( 9)262 

.( 6)132 5752 232 -.( 9)215 

.( 6)113 9556 568 -.( 9)177 

.( 7)982 6695 408 -.( 9)146 

. ( 7)849 9994 800 -.( 9)121 

.( 7)741 7936 742 -.( 9)101 

.( 7)641 5689 752 - r ( 10)846 


9722 222 ,( 1)175 0000 000 
1388 889 .( 2)145 8333 333 
2350 427 .( 3)224 3589 744 
2478 632 .( 4)480 7692 308 
1965 812 .( 4)128 2051 282 
8696 582 .( 5)400 6410 256 

1158 874 .( 5)141 4027 149 
4547 511 . ( 6) 549 8994 470 
5567 199 .( 6)231 5366 092 
9372 602 .( 6)104 1914 742 
1703 316 .( 7)496 1498 770 
3253 685 .(7 ) 248 0749 385 
9706 601 .( 7)129 4304 027 
3966 587 .( 8)701 0813 479 
5943 973 .( 8)392 6055 548 
0621 968 .( 8)226 5032 047 

8469 607 . ( 8 ) 134 2241 213 
1651 639 .( 9)814 9321 650 
4765 550 .( 9)505 8199 645 
9387 0 S 9 .( 9)320 3526 442 
0787 078 . ( 9)206 6791 253 
0198 696 .( 9)135 6331 760 
1610 699 .( 10)904 2211 731 
7056 417 .( 10)611 6790 289 
0436 901 .( 10)419 4370 484 
8806793 .( 10)291 2757 280 

2216 817 .( 10)204 6802 413 
1000 417 .( 10)145 4306 978 
2476 075 .( 10)104 4117 830 
1134 028 .( 11)756 9854 269 
3240 275 .( 11)553 8917 758 
6720 805 .( 11)408 8248 821 
4431 743 .( 11)304 2417 727 
6292 582 .( 11)228 1813 296 
8862 452 .( 11)172 4036 712 
9716 846 .( 11)131 1767 064 

3776 023 .( 11)100 4757 751 
2862 254 .( 12)774 5007 663 
8393 150 .( 12)600 6332 473 
6168 770 .( 12)468 4939 329 
2622 450 .( 12)367 4462 219 
2103622 .( 12)289 7172 134 
4894 049 .( 12)229 5872 257 
5749 393 .( 12)182 8194 575 
8827 904 .( 12)147 1231 630 
8458 443 .( 12)117 5267 941 
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TABLE VI 


(The numbers in parentheses denote the number of ciphers 
between tht decimal point and the first significant figure.) 


p 

A 

/3 

3,0 

1.000 0000 000 

- 1.361 1111 111 

3 5 

.745 3327 179 

- 617 5664 26 6 

4,0 

619 2696 193 

-.362 6910 127 

4 5 

,536 5078 449 

- 238 2141 749 

S 0 

.475 9358 289 

-.167 1854 290 

5.5 

.428 9313 952 

-.122 7793 517 

6 0 

.391 0671 373 

-. ( 1)932 5031 693 

6 5 

.359 7514 629 

- ( 1)727 0467 546 

7.0 

,333 3333 333 

-.( 1)578 9958 809 

7.5 

.310 6966 019 

-.( 1)469 2645 603 

8 0 

.291 0527 351 

-.( 1)386 0218 617 

8 5 

.273 8253 011 

-.( 1)321 6238 375 

9.0 

258 5812 357 

-,( 1)270 9606 497 

9.5 

244 9877 912 

-.( 1)230 5173 392 

10.0 

.232 7844 334 

- ( 1)197 8161 991 

10 5 

.221 7638 626 * 

-.( 1)171 0732 926 

11.0 

.211 7588 265 

-,( 1)148 9801 195 

11.5 

.202 6327 307 : 

-.( 1)130 5610 007 

12 0 

,194 2728 127 

-.( 1)115 0776 715 

12 5 

. 186 5850 870 

-.( 1)101 9640 947 

13.0 ! 

.179 4905 415 

- ( 2)907 8107 302 

13.5 

.172 9222 326 

- ( 2)811 8410 815 

14.0 

.166 8230 401 

( 2)729 0028 821 

14.5 

161 1439 098 

-.( 2)657 1143 441 

15.0 

. 155 8424 640 

-.( 2)594 4165 421 

15.5 

.150 8818 928 

-.( 2)539 4804 055 

16,0 

146 2300 606 

-.( 2)491 1364 663 

16.5 

.141 8587 811 

-.( 2)448 4212 624 

17,0 

.137 7432 239 

-.( 2)410 5360 674 

17.5 

133 8614 260 

-.( 2)376 8148 358 

18 0 

.130 1938 866 

-.( 2)346 6991 057 

18.5 

126 7232 293 

-.( 2)319 7181 992 

19.0 

.123 4339 187 

-.( 2)295 4734 917 

19.5 

.120 3120 213 

-.( 2 ) 2 73 6258 312 

20,0 

,117 3450 034 

-.( 2)253 8854 120 

20 5 

.114 5607 471 

-.( 2 )236 0035 778 

21.0 

.111 8314 598 

( 2)219 7661 473 

21.5 

,109 2654 341 

-,( 2)204 9879 526 

22.0 

.106 8150 524 

-.( 2)191 5083 451 

22.5 

.104 4726 355 

-.( 2)179 1874 832 

23.0 

.102 2311 723 

-.( 2)167 9032 479 

23.5 

.100 0842 483 

- ( 2)157 5486 730 

24.0 

.( 1)980 2598 324 

-.( 2)148 0297 913 

24.5 

.( 1)960 5097 234 

2)139 2638 202 

25,0 

,( 1)941 5425 829 

-.( 2)131 1776 633 
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TABUj VI — (Continued) 


/O 

c 

o 

3.0 

.388 8888 889 

- ( 1)277 7777 778 

3.5 

,116 7093 913 

-.( 2)581 8684 896 

4.0 

• 0)498 5754 986 

-.( 2)185 1851 852 

4.5 

.( 1)251 6301 473 

-.( 3)727 3356 120 

5.0 

.(1)140 7742 584 

-.( 3)326 7973 856 

5.5 

.( 2)846 3824 237 

-.( 3)161 6301 360 

6.0 

. ( 2)537 1649 335 

— . ( 4)859 9931 201 

6.5 

.( 2)355 7417 128 

- ( 4)484 8904 080 

7.0 

. ( 2)243 8852 284 

( 4)286 6643 734 

7 5 

.( 2)172 0852 322 

( 4)176 3237 847 

8 0 

.( 2)124 4257 604 

- ( 4)112 1730 157 

8.5 

9 0 

( 3)918 7769 007 
. ( 3)690 9857 765 

-.( 5)734 6824 363 
( 5)493 5612 689 

9.5 

.( 3)528 1236 437 

-.( 5)339 0842 014 

10.0 

. ( 3)409 4730 527 

( 5)237 6406 110 

10.5 

.( 3)321 5757 191 

-.( 5)169 6421 007 

11.0 

. ( 3)255 4794 154 

-.( 5)122 9175 574 

11 5 

.( 3)205 1024 695 

-.( 6)904 2245 370 

12.0 

. ( 3)166 2345 586 

-.( 6)674 0640 244 

12.5 

.( 3)135 9108 167 

-.( 6)508 6263 021 

13.0 

. ( 3)112 0109 002 

- ( 6)388 0974 686 

13,5 

. ( 4)929 9691 087 

-.( 6)299 1919 424 

14 0 

.( 4)777 3890 832 

-.( 6)232 8584 812 

14.5 

. ( 4)653 9677 383 

-.( 6)182 8395 204 

15.0 

.( 4)513 3898 749 

-. ( 6)144 7498 667 

15.5 

, ( 4)470 8598 806 

-.( 6)115 4775 918 

16.0 

. ( 4)402 7015 521 

-.( 7)927 8837 607 

16 5 

. ( 4)346 0719 079 

-.( 7)750 6043 467 

17,0 

. ( 4)298 7541 988 

-.( 7)611 0454 032 

17,5 

. ( 4)259 0066 153 

-.( 7)500 4028 978 

18.0 

. ( 4)225 4506 124 

-.( 7)412 1003 883 

18,5 

.( 4)196 9877 124 

-,( 7)341 1837 940 

19,0 

.( 4)172 7369 850 

-,( 7)283 8913 786 

19.5 1 

.( 4)151 9876 810 

-.( 7)237 3452 480 

20,0 , 

.( 4)134 1630 697 

( 7) 199 3279 892 

20,5 

. ( 4)118 7926 279 

-.( 7)168 1195 507 

2 i.o ; 

. ( 4)105 4905 051 

-.( 7)142 3771 352 

21.5 1 

. ( 5)939 3873 877 

-.( 7)121 0460 765 

22,0 

.( 5)838 7409 116 

-.( 7)103 2932 157 

22.5 

.( 5)750 7766 502 

-.( 8)884 5674 819 

23.0 

.( 5)673 6666 593 

-.( 8)760 0821 534 

23 5 

. ( 5)605 8783 224 

-.( 8)655 2351 718 

24.0 

.( 5)546 1216 848 

-.( 8)566 6066 961 

24.5 

. ( 5)493 3070 093 

-.( 8)491 4263 585 

25,0 

.( 5)446 5105 454 

( 8)427 4401 392 
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TABLE VI— (Continued) 


jo 


F 

3.0 

2,988 9351 852 

-.948 1481 481 

3,5 

.875 4725 025 

- 189 2894 604 

4,0 

.370 6973 366 

-,( 1)590 7882 241 

4.5 

.186 3394 083 

-.( 1)229 8587 984 

5.0 

.104 0300 478 

-.( 1)102 7515 921 

5,5 

.( 1)624 7216 370 

-.( 2)506 6826 853 

6 0 

.( 1)396 1994 936 

-.( 2)269 0942 361 

6.5 

.( 1)262 2649 076 

-.( 2)151 5410 433 

7.0 

,( 1)179 7450 169 

-.( 3)895 1744 340 

7.5 

,( 1)126 8009 428 

-.( 3)550 2997 477 

8,0 

.( 2)916 6922 412 

-.( 3)349 9462 528 

8,5 

.( 2)676 8238 027 

-.( 3)229 1312 716 

9 0 

,( 2)508 9783 157 

-.( 3)153 8970 152 

9 5 

,( 2)388 9905 578 

-.( 3)105 7119 802 

10,0 

.( 2)301 5839 811 

-.( 4)740 7668 221 

10.5 

. ( 2)236 8373 742 

-.( 4)528 4390 327 

11.0 

.( 2)188 1526 456 

- ( 4)383 0865 153 

11.5 

.( 2)151 0481 065 

-.( 4)281 7940 194 

12.0 

,( 2)122 4214 851 

-.( 4)210 0557 062 

12 5 

.( 2)100 0884 111 

-.( 4)158 4944 724 

13.0 

. ( 3)824 6707 418 

-,( 4)120 9320 228 

13,5 : 

.( 3)684 8389 962 

- ( 5)932 2623 816 

14.0 

.( 3)572 4725 971 

-.( 5)725 5546 509 

14.5 

.( 3)481 5811 670 

-.( 5)569 6913 075 

15 0 

.( 3)407 5132 846 

-.( 5)451 0040 878 

15.5 

,( 3)346 7368 814 

-.( 5)359 7939 891 

16.0 

. ( 3)296 5444 286 

( 5)289 0976 344 

16.5 

.( 3)254 8421 133 

-,( 5)233 8608 712 

17.0 

.( 3)219 9973 645 

-.( 5)190 3777 170 

17.5 

.( 3)190 7274 117 

-.( 5)155 9046 664 

18.0 

.( 3)166 0170 278 

-,( 5)128 3924 319 

18,5 

.( 3)145 0572 508 

- ( 5)106 2973 088 

19.0 

.( 3)127 1993 361 

-,( 6)884 4715 492 

19.5 

.( 3)111 9198 707 

-.( 6)739 4524 672 

20 . 0 , 

.( 4)987 9413 925 

- ( 6)621 0067 139 

20.5 

.( 4)874 7564 250 

-.( 6)523 7749 416 

21.0 

.( 4)776 8023 679 

-.( 6)443 5733 082 

21.5 

.( 4)691 6366 980 

-.( 6)377 1157 568 

22.0 

( 4)617 6240 310 

-.( 6)321 8064 139 

22.5 

.( 4)552 8493 126 

-.( 6)275 5833 129 

23.0 

. ( 4)496 0674 883 

-. ( 6)236 7999 628 

23 5 

.( 4)446 1499 515 

-.( 6)204 1350 268 

24.0 

.( 4)402 1467 932 

-.( 6)176 5230 172 

24,5 

.( 4)365 3958 471 

-. ( 6) 154 0028 718 

25.0 

.( 4)328 7959 685 

-. ( 6) 133 1 661 235 
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TABLE VI — (Cnntiiuu’il) 


/O 


// 

,1 0 

.(1)703 2407 407 1 

311 *>?12 963 

3 5 

,(2)9% 0214 120 

,(3H31 hjh? 940 

4,0 

1 (2)233 6419 753 

.0)100 21% ]06 

4 5 

.(3)711 2449 363 

.(2) <01 482(1 991 

5.0 

. (3)256 2oJo 166 

(2)100 0241 118 

5.5 

(3)104 2151 284 

, (31442 5904 068 

6.0 

.(4)464 3740 588 

.(3)1 97 M) m 

6.5 

. (4)223 1993 819 

(4)416 lo21 940 

7 0 

.(4)113 8216 820 

.(4)482 3607 431 

7.5 

.(5)610 4831 370 

(4)258 5707 479 

8.0 

.(5)341 8161 061 

.(4)144 7403 415 

8 S 

.(5)198 6271 380 

.(5)840 9223 280 

9.0 

.(5)1192274520 

.(5)504 7006 725 

9 5 

.(6)736 45S7 70S 

.(5)311 7158 600 

10.0 

. (6)466 6351 290 

. (5)19/ 4943 272 

10.5 

.(6)302 4952 338 

•(5)128 0172 024 

11.0 

.(6)200 1684 248 

. (6)817 0785 482 

13.5 

.(6)134 9506 134 

.(6)571 0651 

12 0 

.(7)925 4160 570 

(6)391 5914 775 

12.5 

. (7)644 5499 S52 

.(6)272 7356 xU 

13 0 

(7)455 3925 745 

( 6 ) 1 92 6912 065 

13.5 

. (7)326 0189 201 

.(6)137 9465 952 

14,0 

.(7)236 2653 096 

(7)949 6815 ifi.t 

14.5 

.(7)173 1717 709 

,(7)732 7121 000 

15.0 

. (7) 128 2726 524 

.(7)542 7324 021 

15,5 

.(8)959 5450 485 

(7)405 9879 530 

16.0 

(8)724 4284 089 

.(7)306 5lXx> 664 

16.5 

. (8) SSI 6645 046 

.(7)233 4084 850 

17.0 | 

.(8)423 5236 197 

(7)179 1912 40.1 

17 5 ! 

. (8)327 6400 420 

.(7)138 6225 823 

18.0 

. (8)255 2963 323 

(7)108 0139 227 

18.5 

. (8)200 2846 930 

(8)8)7 3860 468 

19 0 

.(8)158 1422 442 

(8)60" 1)830 o03 

19 5 

(8)125 6316 099 

,(8)531 5324 %i 

20.0 

(8)100 3843 545 

(8)424 7132 721 

20.5 

.(9)806 5374 215 

.(8)341 2548 327 

21.0 

(9)651 4166 128 

.(8 1275 (>048 185 

21 5 

.(9)528 7636 968 

.(8)223 7117 748 

22 0 

(9)431 2539 387 

.(8)182 4566 123 

22.5 

.(9)353 3296 685 

.(8 (149 4578 961 

23.0 

.(9)290 7474 080 

. (8) 123 0102 248 

23,5 

. (9)240 2476 147 

.(8)101 6445 207 

* 24 0 

.(9)199 3121 346 

,(9)843 2535 668 

24.5 

.(9)166 9628 406 

(9)706 388') 111 

25.0 

.(9)138 7382 906 

.(9)586 0755 711 
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TABLE VI — (Continued) 


p 

i 

J 

3,0 

-,( 1)234 9537 037 

. ( 2)178 2407 407 

3,5 

-,( 2)232 9282 407 

.( 3)127 3148 148 

4,0 

-.( 3)408 9506 173 

. ( 4) 169 7530 864 

4 5 

-.( 4)972 9456 019 

.( 5)318 2870 370 

5.0 

-.( 4)282 5435 730 

( 6)748 9106 754 

5 S 

-.( 5)947 9582 728 

( 6)208 0307 432 

6,0 

-.( 5)355 3443 795 

.( 7)656 9391 889 

6,5 

-.( 5)145 5344 260 

.( 7)229 9287 161 

7.0 

- ( 6)641 0133 904 

.( 8)875 9189 186 

7 5 

( 6)300 1017 659 

( 8)358 3304 667 

8,0 

-.( 6)148 0060 623 

( 8)155 7958 551 

8 5 

- ( 7)763 5619 773 

.( 9)714 0643 358 

9.0 

-.( 7)409 7430 989 

.( 9)342 7508 812 

9.5 

-,( 7)227 6566 932 

.( 9)171 3754 406 

10 0 

-.( 7)130 4646 031 

.( 10)888 6133 957 

10.5 

-.( 8)768 7010 766 

( 10)476 0428 905 

11 0 

-.( 8)464 4029 703 

. 00)262 6443 534 

11.5 

-.( 8)287 0085 966 

.( 10)148 8318 003 

12 0 

-,( 8)181 0859 646 

.( 11)864 1846 467 

12.5 

-,( 8)116 4409 022 

.( 11)513 1096 340 

13 0 

-.( 9)761 8900 625 

.( 11)310 9755 357 

13.5 

-.( 9)506 5928 672 

.( 11)192 0731 250 

14 0 

-.( 9)341 8901 625 

.( 11)120 7316 786 

14.5 

-. ( 9)233 9443 041 

.( 12)771 3412 798 

15,0 

-.( 9)162 1522 356 

( 12)500 3294 788 

15.5 

-.( 9)113 7486 502 

.( 12)329 1641 308 

16.0 

-( 10)806 9508 539 

.( 12)219 4427 539 

16 5 

-( 10)578 5246 623 

( 12)148 1238 589 

17 0 

- 00)418 8850 767 

( 12)101 1577 573 

17 S 

-( 10)306 1363 264 

. 03)698 4702 287 

18.0 

- 00)225 7107 282 

.( 13)487 3048 107 

18.5 

- 00)167 8017 503 

( 13)343 3283 894 

19.0 

-( 10)125 7344 857 

. ( 13)244 1446 325 

19.5 

- 01)949 1786 023 

.( 13)175 1472 363 

20 0 

-( 11)721 6269 402 

.( 13)126 7022 561 

20 5 

-( 11)552 3276 496 

.( 14)923 8706 171 

21.0 

- 01)425 4604 167 

.( 14)678 7620 861 

21.5 

- 01)329 7379 935 

( 14)502 2839 437 

22,0 

- 01)257 0422 413 

.( 14)374 2507 816 

22.5 

-( 11)201 4893 910 

.( 14)280 6880 862 

23.0 

- 01)158 7837 578 

.( 14)211 8400 650 

23.5 

-( 11)125 7671 107 

.( 14)160 8415 309 

24 0 

-( 11)100 1019 200 

.( 14)122 8244 418 

24 5 

-( 12)805 1862 542 

( 15)948 6730 535 

25 0 

-( 12)642 9736 393 

.( 15)728 0195 608 
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TABLE VII 


(The numbers in parentheses denote the number of ciphers be- 
tween the decimal point and the first significant figure ) 


£> 


S' 

3 5 

28.751 2015 275 

-2.018 4913 853 

4,0 

1,362 9280 045 

-.639 9553 571 

4.5 

.826 9536 776 

- 286 6103 525 

5.0 

.557 2123 756 

ISO 4344 206 

5.5 

.399 2748 295 

(1)869 6309 477 

6 0 

298 3433 421 

-.(1)537 5591 718 

6,5 

229 9463 348 

-.(1)349 3835 704 

7.0 

.181 5696 107 

- 61)236 2100 221 

7 5 

. 146 2040 440 

-(1)164 8961618 

8,0 

119 6560 886 

- (1)118 2279 748 

8.5 

(1)992 8681 536 

-.(2)867 1239 S12 

9 0 

.(1)833 6752 283 

- (2)648 5413 101 

9 5 

! , (1)707 2793 855 

-,(2)493 4138 461 

10.0 

.(1)605 5349 149 

(2)381 0899 486 

10 5 

(1)522 6404 211 

-. (2)298 3079 274 

11 0 

.(10454 3790 237 

-.(2)236 3306 957 

11.5 

.(1)397 6190 688 

-.(2)189 2706 474 

12 0 

.(1)350 0205 660 

-.(2)153 0798 651 

12.5 

(1)309 7896 316 

-.(2)124 9247 269 

13.0 

.(10275 5429 712 

-.(2)102 7891 576 

13.5 

.(1)246 1999 134 

-.(3)852 1744 589 

14 0 

.(1)220 9073 683 

-.(3)711 4425 518 

14.5 

(1)198 9854 583 

-.(3)597 8026 293 

15 0 

.(1)179 8875 910 

• -(3)505 3397 258 

15.5 

.(1)163 1707 725 

-.(3)429 5745 974 

36 0 

,(1)148 4732 835 

-.(3)367 0820 942 

16.5 

(1)135 4977 156 

- (3)315 2193 163 

17 0 

.(1)123 9979 553 

-.(3)271 9296 956 

17,5 | 

.(1)113 7691 065 

(3)235 6002 537 

18 0 

.(1)104 6396 241 

-.(3)204 9565 434 

18.5 

.(2)964 6512 317 

-.(3)178 9845 869 

19 0 

(2)891 2347 318 

-.(3)156 8723 569 

19.5 1 

.(2)842 5297 627 

-.(3)137 9655 374 

20 0 

.(2)765 3875 595 

-. (3)121 7338 175 

20 5 

.(2)711 3114 773 

-.(3)107 7450 175 

21.0 

.(2)662 2270 210 

-. (4)956 4508 777 

21 5 

(2)617 5695 829 

- (4)851 4293 652 

22,0 

.(2)576 8498 581 

(4)759 9627 856 

22 5 

.(2)539 6419 780 

- (4)680 0595 916 

23,0 

.(2)505 5744 586 

- (4)610 0441 976 

23 5 

. (2)474 3260 497 

-.(4)548 5160 065 

24 0 

,(2)445 5992 272 

-.(4)494 2969 537 

24 5 

.(2)419 1587 794 

- (4)446 3969 384 

25.0 

.(2)394 7661 411 

-. (4)403 9598 359 
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TABLE VII — f Continued) 


p 

C' 

D/ 

3.5 

357 6197 452 

- (1)173 0659 191 

4.0 

(1)798 5119 048 

- (2)282 3837 868 

4.5 

.(1)269 7269 968 

-.(3)732 8051 663 

5.0 

.(1)111 5137 722 

-.(3)241 1381 219 

5.5 

.(2)523 4528 750 

-,(4)925 0558 091 

6.0 

.(2)268 7939 211 

-.(4)396 2769 318 

6.5 

.(2)147 7366 521 

-.(4)184 7282 763 

7.0 

(3)856 9155 732 

- (5)921 2018 141 

7.5 

,(3)519 3903 212 

-.(5)485 5539 780 

8 0 

.(3)326 6117 660 

-.(5)268 1183 076 

8.5 

.(3)211 9254 433 

- 15)154 0554 574 

9.0 

.(3)141 2859 194 

-.(6)916 1750 131 

9 5 

.(4)964 5009 167 

-. (6)561 5350 601 

10.0 

. (4)672 3458 517 

-.(6)353 4749 049 

10.5 

. (4)477 5025 292 

(6)227 8602 485 

11,0 

.(4)344 8411 857 

- (6)150 0558 578 

11.5 

(4)252 8236 420 

-.(6)100 7434 807 

12.0 

(4)187 9176 343 

-.(7)688 3248 748 

12.5 

(4)141 4318 835 

-.(7)477 8804 517 

13 0 

.(4)107 6721 454 

(7)336 6794 370 

13.5 

,(5)828 3962 809 

-.(7)240 4245 028 

14 0 

.(5)643 5779 618 

-.(7)173 8435 375 

14.5 

.(5)504 5238 369 

(7)127 1625 885 

15 0 

.(5)398 8437 502 

-.(8)940 2155 070 

15 5 

. (5)317 7726 592 

-.(8)702 1757 322 

16.0 

. (5)255 0351 049 

-,(8)529 3336 632 

16.5 

.(5)206 0875 950 

- (8)402 5511 467 

17.0 

,(5)167 6061 262 

- (8)308 6648 353 

17 5 

(5)137 1349 784 

-.(8)238 5149 470 

18.0 

.(5)112 8433 182 

-.(8)185 6576 541 

18.5 

. (6)933 5432 357 

-.(8)145 5130 248 

19.0 

.(6)776 2421 574 

- (8)114 7942 689 

19 5 

(6)648 5562 423 

-.(9)911 2100 394 

20.0 

.(6)544 3500 271 

- (9)727 5436 127 

20 5 

.(6)458 8701 286 

-.(9)584 1368 200 

21. q 

.(6)388 4099 727 

- (9)471 4844 101 

21,5 

.(6)330 0128 092 

-.(9)382 4793 533 

22.0 

.(6)281 5293 946 

-,(9)311 7704 559 

22,5 

.(6)240 9925 109 

-,(9)255 3016 408 

23.0 

.(6)206 9976 250 

-.(9)209 9789 105 

23.5 

(6)178 3794 880 

- (9)173 4277 959 

24 0 

.(6)154 1991 740 

-.(9)143 8154 281 

24 5 

.(6)133 6979 531 

- (9)119 7203 889 

25.0 

,(6)116 2537 745 

-.(9)100 0289 166 
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TABLE VII— (Continued 


/=> 


r ' 

3.5 

1.579 8913 122 

- 291 1769 387 

4.0 

.344 9276 62 Q 

- ( 1)455 1504 630 

4 5 

.115 4442 217 

-.( 1)115 8781 297 

5.0 

.( 1)475 1410 972 

- 4 ( 2)377 5757 988 

5.5 

.( 1)222 4862 588 

-.( 2)144 0406 919 

6,0 

.( 1)114 0808 001 

-.( 3)614 9607 748 

6.5 

.( 2)626 4468 334 

-,( 3)286 0508 258 

7 0 

i . ( 2)363 1390 089 

-.( 3)142 4423 327 

7.5 

( 2)220 0141 501 

-.( 4)750 0507 440 

8.0 

.( 2)138 3131 064 

-.( 4)413 8794 788 

8.5 

.( 3)897 2722 669 

- ( 4)237 6853 941 

9.0 

.( 3)598 0994 308 

- ( 4)141 2990 508 

9.5 1 

.( 3)408 2508 260 

-.( 5)865 7917 011 

10 0 

.( 3)284 5633 018 

-.( 5)544 8786 925 

10 5 

.( 3)2020841107 

-.( 5)351 1847 710 

11.0 

( 3)145 9325 974 

-.( 5)231 2393 114 

11.5 

.( 3)106 9873 276 

- ( 5)155 2312 455 

12.0 

.( 4)795 1834 088 

-.( 5)106 0518 396 

12.5 

. ( 4)598 4598 613 

-.( 6)736 2301 182 

13.0 

.( 4)455 5973 535 

-.( 6)518 6638 399 

13.5 

. ( 4)350 5159 780 

( 6)370 3629 282 

14.0 

.( 4)272 3103 032 

-.( 6)267 7874 363 

14.5 

.( 4)213 4710 252 

-,( 6)195 8739 375 

15.0 

( 4) 168 7544 633 

-.( 6)144 8213 551 

15.5 

.{ 4)134 4512 878 

-,( 6)108 1535 449 

16.0 

.( 4)107 9058 431 

( 7)815 2967 606 

16.5 

.( 5)871 9546 641 

-.( 7)620 0115 559 

17.0 

.( 5)709 1357 523 

-. ( 7)475 4002 735 

17.5 

.( 5)580 2104 892 

-.( 7)367 3519 091 

18.0 

.( 5)477 4317 347 

-.( 7)285 9398 751 

18.5 

.( 5)394 9737 353 

- ( 7)224 1091 135 

19.0 

.( 5)328 4199 721 

-.( 7)176 7967 053 

19 5 

.( 5)274 3965 893 

-.( 7)140 3360 406 

20.0 

( 5)230 3075 611 

-.( 7)112 0487 209 

20.5 

. ( S ) 194 1416 638 

-.( 8)899 6217 202 

21.0 

( 5)164 3306 063 

- ( 8)726 1234 346 

21 5 

.( 5)139 6233 668 

-.( 8)589 0458 851 

22.0 

.( 5)119 1105 861 

-. ( 8)480 1471 084 

22 5 

( 5)101 9599 333 

- ( 8)393 1799 685 

23.0 

.( 6)875 7714 928 

-.( 8)323 3791 671 

23.5 

.( 6)754 6921 922 

-,( 8)267 0876 406 

24.0 

.( 6)652 3888 702 

-.( 8)221 4825 166 

24.5 

.( 6)565 6513 791 

- ( 8)184 3745 724 

25 0 

,( 6)491 8478 631 

-.( 8)154 0485 226 
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POLYNOMIAL APPROXIMATION 


TABLE VII— (Continued ) 



Cr 

H' 

3,5 

,(1)143 3986 442 

.(1)543 8043 981 

4 0 

.(2)165 38S2 513 

.(2)616 8981 481 

4 S 

.(3)325 3719 022 

.(2)120 1878 234 

S 0 

. (4)847 0633 624 

.(3)311 2518 155 

5.5 

.(4)264 7923 510 

.(4)969 9931 081 

6.0 

,(5)944 9259 723 

.(4)345 4902 916 

6.5 

.(5)373 3302 227 

.(4)136 3314 118 

7.0 

.(5)160 0116 167 

.(5)583 8397 737 

7,5 

.(6)733 3626 200 

.(5)267 4300 767 

8.0 

.(6)355 6040 392 

.(5) 129 6221 514 

8.5 

.(6)180 9471 484 

.(6)659 3767 449 

9 0 

.(7)960 0363 156 

. (61349 7616 94'' 

9 5 

,(7)528 3674 230 

. (6) 192 4628 42 1 

10.0 

.(7)300 3768 815 

.(6)109 4008 386 

10.5 

. (7) 175 7761 725 

.(7)640 1325 618 

11.0 

(7)105 5698 126 

. (7)384 4277 647 

11.5 

.(8)649 1137 736 

.(7)236 3567 152 

12.0 

.(8)407 7298 452 

.(7)148 4557 926 

12.5 

.(8)261 1497 436 

.(8)950 8148 611 

13.0 

.(8)170 2826 250 

,(8)619 9571 695 

13.5 

.(8)112 8752 998 

.(8)410 9394 710 

14.0 

.(9)759 6818 749 

.(8)276 5670 073 

14.5 

.(9)518 5449 520 

(8)188 7758 141 

15.0 

.(9)358 6183 357 

. (8) 130 5524 791 

15.5 

.(9)251 0639 560 

.(9)913 9675 989 

16 0 

.(9)177 7849 669 

.(9)647 1965 291 

16.5 

.(9)127 2480 223 

.(9)463 2203 741 

17 0 

.(10)919 9584 852 

.(9)334 8890 426 

17 5 

.(10)671 4105 266 

.(9)244 4091 600 

18 0 

.(10)404 3960 450 

,(9)1 79 9705 267 

18.5 

.(10)367 1241 847 

.(9)133 6401 141 

19,0 

.(10)274 7920 816 

.(9)100 0289 750 

19.5 

.(10)207 2366 516 

.(10)754 3731 21 4 

20 0 

.(10)157 4099 489 

.(10)572 9940 142 

20.5 

.(10)120 3776 663 

.(10)438 1898 319 

21.0 

.(11)926 5399 250 

.(10)337 2711 214 

21.5 

.(11)717 5522 633 

. (10)261 1964 982 

22 0 

,(11)558 9721 931 

.(10)203 4711 96 6 

22 5 

.(11)437 8836 453 

.(10)159 3934 600' 

23 0 

.(11)344 8668 374 

,(10)125 5342 827 

23.5 

(11)273 0031 904 

.(11)993 7516 417 

24.0 

.(11)217 1767 993 

.(11)790 5382 130 

24.5 

,(11)173 5819 878 

.(11)631 8492 407 

25.0 

.(11)139 3623 589 

,(11)507 2869 968 


H. T. DAVIS 


195 


TABLE, VII — ( Continued' 


3.5 -.( 2)270 9986 772 

4.0 -.( 3)227 3478 836 

4.5 -.( 4)343 6965 064 

5.0 -.( 5)713 2482 623 

55 -.( 5)182 5352 461 

6.0 -.( 6)544 3210 423 

6.5 -.( 6)182 6693 153 

7 0 ( 7)674 0025 445 

7.5 -.( 7)268 9333 213 

8 0 -.( 7)114 6212 362 

8.5 -.( 8)516 9083 906 

9.0 -.( 8)244 8220 580 

9.5 -,( 8)121 0509 065 

10.0 -.( 9)621 7703 060 

10.5 -.( 9)330 4159 767 

11.0 -.( 9)181 0370 007 

11.5 -.( 9)101 9713 676 

12.0 -.( 10)588 9829 883 

12.5 -. ( 10)348 0938 563 

13.0 -.( 10)210 1044 796 

13.5 -.( 10)129 2993 719 

14.0 -.( 11)810 1043 368 

14 5 - ( 11)516 0618 886 

15.0 ( 11)333 8664 755 

15 5 -.( 11)219 1292 642 

16.0 -,( 11)145 7726 865 

16 5 -.( 12)982 0445 539 

17.0 -.( 12)669 4697 055 

17.5 -.( 12)461 4992 604 

18.0 -.( 12)321 4946 024 

18.5 -.( 12)226 1959 235 

19.0 -.( 12)160 6464 099 

19.5 -,( 12)115 1112 743 

20.0 -.( 13)8318162819 

20.5 -.( 13)605 9221 411 

21.0 -.( 13)444 7507 764 

21.5 -.( 13)328 8288 593 

22.0 -.( 13)244 8106 616 

22.5 -.( 13)183 4686 278 

23.0 -.( 13)138 3685 504 

23.5 -,( 13)104 9876 917 

24.0 -.( 14)8012235814 

24.5 -.( 14)614 8704 752 

25.0 -.( 14)474 3701 124 



.( 3)135 1095 994 
.( 5)844 4349 962 
.( 6)993 4529 367 
.( 6)165 5754 895 

. ( 7)348 5799 778 
. ( 8)871 4499 445 
. ( 8)248 9856 984 
. ( 9)792 2272 223 
. ( 9)275 5572 947 
. ( 9) 103 3339 855 
( 10)413 3359 421 
.( 10)174 8728 986 
.( 11)777 2128 825 
.( 11)360 8488 383 

.( 11)174 2028 875 
. ( 12)871 0144 373 
.( 12)449 5558 386 
.( 12)238 8265 393 
.( 12)130 2690 214 
.( 13)727 9739 432 
.( 13)415 9851 104 
( 13)242 6579 811 
.( 13)144 2831 239 
.( 14)873 2925 919 

. ( 14)537 4108 258 
.( 14 ) 335 8817 661 
.( 14 ) 212 9981 931 
.( 14)136 9274 099 
.( 15)891 6203 434 
.( 15)587 6588 627 
.( 15)391 7725 751 
.( 15)264 0206 484 
.( 15)179 7587 394 
.( 15)123 5841 333 

.( 16)857 5225 577 
.( 16)600 2657 904 
.( 16)423 7170 285 
.( 16)301 4909 626 
. ( 16)216 1633 317 
.( 16)156 1179 618 
.( 16)113 5403 358 
.( 17)831 2774 587 
.( 17)612 5263 839 
.( 17)454 1098 277 



THE PRECISION OF THE WEIGHTED AVERAGE 


By 

H, Mimcer Grtjzewska, Fh, Dr,, 
Warsaw, Poland , 


Introduction , We shall consider an infinite universe of el 
ments characterized by pairs of variable quantities ?c L , 

3 Regarding the values of y £ < as the weigh 

to be assigned to the variates x L the weighted average of m; 
be denoted by } i.e. 




% + ~ 


All possible samples, each of A/ pairs of variates Xj 9 y t f 

that can be selected from the universe constitute the sample popi 

lation, 

Our problem is to obtain an expression for the probable pr 
cision of the weighted average x y according to certain hypothes 
concerning the selection of the pairs of variates in various sample 
Professor Bowley discussed this problem in his paper on “Precisu 
of Measurement Attained in Sampling” 1 presented in Rome durii 
the Congress of Statistics 1925. In this paper Professor Bowlf 
made no allowance for correlation between the variates X £ and y 
In the present paper I shall attempt to eliminate this restrictio 
I am greatly indebted to Professor A L. Bowley for suggestio 
regarding the simplification of the proof of theorem II and for h 
general assistance in improving the form of this paper. 

Let us suppose : 

(a) the pairs of elements selected from the universe are indepei 
dent of each other, 


Cambridge 1925. 
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(b) the number of pairs in each sample is so large that ~ may 
be neglected, 

(c) the frequency surface f(x,yA s normal, i.e. the probability 

that the particular pair will be selected is, 


> 2 ?* — £ 

* <5. 




Jrficj-xXy -y 




where x / y an d r designate the parameters character- 

izing the surface, 

(d) the a priori chance that the parameters of (c) are equal to 
given values may be defined by the function/ 5 ?^, y ^ <£,, r) 
where this function is integrable, can be expanded in Taylor’s 
series and converges over the whole space. 

Let the calculated characteristics of the sample be, 

Xy the weighted average of with y* as weights AS**/,*4 4 — 

Y arithmetic average of the variates y it ( /- 1,2,3,"- -,N) 
X the arithmetic average of the variates x^( » » ) 

3# the standard deviation of the variates x^f » » ) 

Sy the standard deviation of the variates y t / >, t) j 

P the coefficient of correlation between the variates x^ and y 

The expressions representing the most probable values of the 
weighted average and its standard deviation are independent 
whether the parameters of the universe are known or unknown. 
In Parts I, II, and III we shall consider the respective cases, 

(a) when all parameters are unknown, 

(b) all but y are unknown, 

(c) all but y and <?y are unknown. 

In Part IV we shall consider the generalized case of Part I 

when there are H sets of elements, i.e. 


f*U Ac 

jsJ.J.3 
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in the universe. In order to consider this case we shall, at the be- 
ginning of Part IV, slightly change the hypotheses and modify the 
above notation. 


PART I 

Case Where All Parameters Are Unknown 

Theorem (1.1). If hypotheses (a) and (c) are satisfied anu 
if SyfJ- O then, the most probable value of « Xy. 

Proof. If jQ denotes the probability of getting N particular 
pairs of variates, then it follows from hypotheses (a) and (c) that. 



N 


N 




Sjf+fa-X) S*+fy-Y) 




°y 


■£r 


QSxSy+fe-XXy- Y) 


Taking the partial derivatives of with respect to x,y <%,, c/y 
and r , setting them equal to zero, and solving for x, % cp 
and r , yields 


(2) 

x*X, 

y- Y 

cr x = S *' r *j? 
a y " 3 y> 

hence 

x*x, y-y; 

<y ;c =5 )i , and r - P will 


make a maximum, and the maximum value of ^ is, 


( 3 ) 


P 

rr?<3% 


1 


eS x J y JrrfTP* 


/V 
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The weighted average and X y can be expressed in terms of 
x.y, Cy, r and X, X S„, 5 y} P respectively, 

// 

Xy=E (x* ■ y^)/y £ (by definition) 

A/ 


J £ 


N £*i y ‘ -^-^(since 4,Zv,*Y by definition) 

w yr m n i v j t 


hence, 


Z?Sj.S 

X* — —*■ + 


m 


y y 

similarly. 


> X 


^i£wxr- e s*Sy) 


r<fy<f w 


This proves theorem (1.1). 

Theorem (12). If all four hypotheses are satisfied and if 
S x Sy (T- X?)4 O then the a posteriori probability P that the 
sample came from the universe, the weighted average of which 
satisfies the inequality | x y - X y \ * € , can be expressed by, 


(5) 


v. 



where 
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Proof, It follows from (4) that, 


x~ x - x. y -Xy- £f± 5 y 

y Y 

Substituting the above value of (x.-X)m (l) we shall have, 


( 6 ) 


1 -V N 


P = - J e zn-X*) 

n I o x Oy XrnJJYr^/ 


w 


where 

Y\* 


W- V 2 +/Y^} +f— Jr/C + (J. r Z)fYlX.) 

I ^i./ ( o y X cr^o-y { Oy y 


and 


. V- - & ^X)r r/YlX+ 

V o'# t i / ( ay y / 


Let: 


■Xy Xy = d Sy. , C? X - Sy * A Sy 

y- Y~ <x"jy, ay-5y* X' J y 




then, 






i )a+ a 


A/ 

e 


N 

£p-(Py>J*] 


in which 


(7 M ■* ^ 4 -^-, 


and 
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Taking the Ingaiitlnn of - — we shall have, 

/ T77^^ 

~ p '' >n ax „ , 

N to #-fr~- A > where 

0 . A -const + fogfl+A,) 

\p) f 




Expanding /) in tern' of the small quantities A'A" d'df'p 
to second powers ) t ausive and letting K* obtain 


i {o $ j 5 £*~ a »+** where 

1 s, t dA dA I rj) 

2 Ap const + 

V [ /fd'- 2f<Ct-fC e Jd%Cl-/<*Jp 2 

+ ~ 7 l-P s j~~ [_ CJ-PVfl-KV 

(8 1 ) 1 -P* l-P z +K*V+?*)_ 

j+ir i b-* 9 tt : x*2Lf ^ci-tfVd! y a* 

* 'Yilti’V i+x^-PYl-kyh 


_ct* 

/ jtK*{l-te z (l kV\ 


(Wc shall make ubc of the above substitution in the next paragraph, , 
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Therefore the probability of getting a particular set of N pairs of 
variates can be expressed approximately by, 

(9) P„ - a const times e A/ '* / 

Then it follows from hypothesis (d) and (8') that the a pos- 
teriori probability P that the sample came from the universe — 
the weighted average of which satisfies the inequality 
whatever the parameters and r* ma Y be — is expressed 


(10) P- 


JL J “"/ F r* y ,- -r)e 

.Xy-e -<* -f y 




at S *" atr* 

x y 


■to -/ 


rje 


-*A/ f A? ) 


d# : d r 

Xy 


We may write, 




- A/fa, +-4j) 


rj<? 


/- Ndf_ 




-A/fXy-Jfy) 


where 


<5 - since f r d y ) - d, 


t t*y. k < 5 ,, rt *$. r J e 


-a tfal/2+4i) 
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[ Let, [Nfxy-Xy) 
and 



( 12 ) 


then, Pa , y, , 

<ef77 'Ws 

It follows from (8'), (11), and (12) and hypothesis (d) that 
t(w) can be developed in Taylor's series for all values of A/, 
hence, 


(13) 


‘ ~] = fa 


Neglecting terms of order of fy/ J we shall have, 

efi/ _/e j* ^ _/£ \* 

(i4) p=f \<?foj+s <W1 * ^ acce 

*L <£•/*/ ' -ruo 


r Ygf 

but, / t?e ' o/ cf 

J eJ77 

e { Jrr* 


oa 

o* j-e 

-00 


(odd function) 


and 


hence, P* 


/*<z 


■^Yi) 


ctf 




Let c/-= X=- and E*y/Ni 

{177 


then P 


i 


^TfZ 


~~m< 


r<< % 


’dt 


This proves theorem (L2) 
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PART II 


Case W here y is Constant 1 


Theorem (2,1). If hypotheses (a) and (c) are satisfied and 
if 

f (la) oj — + 


(l.b) 


(lc) ^ Kf)/f/+ PK 7 
(l*d) JC y +<r,f- 0 \jr+y fy-kj' 


where of f a£ f and >c 0 arc the most probable values of 
and ay respectively, and, 


( 2 ) 




K - 


* - 


y- Y 
s r 

y-y 




Proof. The probability of getting N particular pairs of vari- 
ates is given by (6) of Fart I. Taking the partial derivatives of 
with respect to eg, cjy n and , and setting them equal 
to zero, we obtain, 


( 3 ) 


(3.a) 

(3.b) 

(3.c) 


^•2fJ.rV*<r y W^O 

- • Za-rV-SrW-rf-rVWr =<? 


t (3d ) 

l Case wheie all the parameters but y are unknown. 
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where Wj . , Wj and U/ / mean the partial derivatives of W 
x y 


with respect to &y and r respectively, 
But, 

vj -z{3L.) 2 +2rQ 

^ ^ (aj cx^Oy 


°Wy 


wj = 2 V 0 y Vi . - //&)*+ *? r/p ik&. z(l-r s /~j 
<7y y v y I cfc/ 0 * <3^ I CTy/ 

w' r = Z VV'-ZP ~2rf^.y 
r a *<*y 1 cr y / 


since V= O , we obtain, 


( 3 ') 


(3',) (1 -rV-f%)\rP || .* 

(3-.b) rP | ±-«-r*)(rf) Z -0 

C3'.c) ju 


Uy/ J * ^ Oy, 

Solving for cjt ^ and a 1 from (3) and making use of the sub- 

stitutions from (2) we get the most probable value of cr and r t 

** y 

r I 

= cr, 


(4)i 


^ V 


cty-cfe = Sy Y + 't* 


( 4.a) 

(4 b-) 

(4.c) 

and from (3.d) we obtain, 

(4.d) # y - =Xy - + r a cf t f- 'k+k, J 


= & yj+*?j /fJ+Ptt*) 
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(4'.b) S y = <55 (7-p 

(4'-c) P. r a jfl- k ?)/(!- Pp< V 

hence, 

PS^Sy = <^< 3 / 2 , fl-k,V 

Substituting the above value of P Jy in (4.d) we obtain, 
(4'.d) x, y = x 0 =Xy + cr y r a k /■ <f, ~ kf) 

This proves theorem (2.1) 

If we denote the maximum probability by ^^then, 



Theorem (2 2). If all four hypotheses are satisfied and if 
S# $ yCT^JfO X hen. the a posteriori probability, P that the 
sample came from the universe, the weighted average of which 
satisfies the inequality | x y - Xy | ±6 , can be expressed by, 

( 6 ) 


Proof. Let n^t-r^k,, >? z O then by substituting the 

values of S Xf Jy and p from (^.a), (4bb) and (4'.c) we get, 

2 In this case the function t = ‘fy y >y,d Xtl dy t r ) in (d) is Ffcy, r ) 
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hence, 


(8) JL-/*- 

P ma x l ^ 


<3^ ££ /i'/^ 


-^77^ 1 ‘S' V • / " / * 


. ^ -V 
- 

P 


A/ 


~ 2 {l-rV 


iv 


( 9 ) 


“v ✓ P 

raking the logarithm of lettmg 

r*fr 0 +/=) t 
Cjy.^V+JV, > 6 y. *>**,<* 
we shall have, 

i Pma# _ ^ where 

/v *v -^r ' 

- <w +dpa*XMyQ*M*f ty [ fc^J ^ * 

Expanding ^ in terms of the small quantities ^ *' and / 3 
we obtain, 

J. ft* const +A,+A Z where 

' \ L „ 

► j£j£ , „ y^w*J| *&**•* %-*?&**' )eiA 

■Z(k+k, )<*/° -Zrfa-tfM A 
-ZrTl-kfk+Zk,^ X'/O-Zto {A-*, )* A’j 111(1 


[ 10 ) 


„ J A'dA ^ +A 2 ±L) 
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The expression representing the value of A, is quadratic in 
form in terms of the variables X, A/d'and /d where all the co- 
efficients are positive, 

, (k,+k)d'+r 0 {l-kVd''i-\i-k(k+2k,)\ X z 

'* JO-rf) 1 ^ Y-tf+^kfk+tk,) / 

, rM f, ". WHYM/s-rXA 

(n) Xl-XM-r/+r/k{k^Al ° 2\l-r 0 *+r 0 ‘k'(l.r 0 *k ,*) J / 

(t~rfX'l+kk,) Z +Cf+$M*(i-k?)( / o [k+k / -r 0 i k l (J+kk l J\ a' 1 
* P\l-rXr/m7XXJ\ {/-/ (l-r o Va+Uk^fUr/JXO-k?)) 

d fZ (I~k*) 

+ ^ [(l-r/)(/Xkk, )*+fi+ r, VkVy-k/J] 


For the rest of the proof of this theorem we proceed as in 
Part I and can obtain. 


( 12 ) 


_ a > 

a Y 177 


k(7-hnV+(-t-r 0 *) 


(1 t t -h' k 

7XX~JT 


Notice that if y= Y then. 


K,*k r O , o-, = 3 x , -S x , <? - £> and X 0 *JY y 
(13) ^ Y sa Y' = ^N~ 


hence CTy, <C if 4 O where cr is given by (5) Part I, 


PART III 

Case Where y and dy are Constants 3 

Theorem (3.1). If hypotheses (a) and (c) are satisfied and 
if J x S y (1'PfyO then, 


8 Case where all the parameters but y, dy are unknown* 
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^ X. / rj 


a, = P V/ * O-P 2 ) Sy 2 


sy r ^ 


OM (H>) 


/ J &**yW/-WJy* 


( i x) ^ - r v - *&*x(£Z<.* )**+*$£ 

X oy I &y ys a 


L v ‘* ' ° y Y Sy ( *£ r -?S w -a- 

where 4, Q and x o are the most probable values of a^, r and 
respectively and 

3*-**'; 2**a; * c 

Y a- y cry y 

Theorem (32 ) . If all four hypotheses 4 are satisfied and if 
Sy, 3y (i~ then the a posteriori probability P that the 

sample came from the universe, the weighted average x y of which 

satisfies the inequality J - X y | < £\ can be expressed by, 

(f t z 

, 0 s P* — - f e * a Y, di where 


'(t-rf) U+( 


Notice that if y- Y and cTy* then , 


Of* 3 ^, q » P ; * 2 Cy and 

cr Y/ = Oy - .Jf? 

hence cfy < &y* if PfO 
where &yi is given by (12) Part 1L 

As the proofs of theorems (3,1 ) and (3,2) do not differ from 
the proofs of theorems (2,1) and (2,2,), we shall omit them * 

4 In this case the function F(x yt y, <fx,<* yt (d) is Ffoyt#*' r ) 
*Part I and II were presented in Wflno during the II Assembly of 
Polish MuthemntH wns. 
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PART IV 


In this Part we shall consider the generalized case of Part I 
where there are k sets of elements characterized by pairs of vari- 

, , . . v / 

able quantities, x- , y i 

fc j 

£ x y Ay k a An 
Let, — 

ZAy> * A 

where Xy is the weighted average of the variates x- , with yy 

as weights, and Ay the sum of these weights. Our problem is to 
obtain an expression for the probable precision of the quantity x 
according to certain hypotheses. 

We shall replace hypothesis (b) of the introduction by hypo- 
theses (V ) and (b" ) where, 

(b ' ) the* number {N = N l *-N 2 +N 3 * 0 f pairs in 

each sample is so large that ~ may be neglected, 

(b " ) each of the numbers N(( {=l t Z,3. •••-, k) of pairs 
from separate sets is so large that^has a significant value, i.e., 

Nf 

N 

' / 

Let us replace in hypothesis (c) Pf by P ( ) 

and x, y J a#, cs yj r by xf 'yi an d refer to the 

corresponding general hypothesis by (c ' ). Likewise if in hypo- 
thesis (d) we replace Ff x yt y, d x ,d y , r by Ff *y, yfffj, dy, r f) 
we obtain the generalized hypothesis (d 7 ). 


ft* 1,2,3, , k 1 

[(*1,2,3, , oof 
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Wo *diall denote the calculated characteribtiqs of the sample by 
Xy, xfy U*. Sy, p, (i -l 1 3, ■ ■ • i Nf) 
corresponding to the values Xy t X t J yj as defined 

in the introduction page 197. 

Theorem (4.1). If hypotheses (a) and (c 1 ) are satisfied 

j> j> 

and if (1-T2) S x $y 4 O then the most probable value of a: is X 
where, 

X=ZXy 

/ A 

Proof*. Let be the probability of getting a given set of 

a 

N pairs of variates , y t , then it follows from hypotheses 
(a) and (c 1 ) that, 



*The proofs of the theorems (4.1) and (4.2) shall be given in very 
abbreviated form as the method of proofs of these theorems does not differ 
from the proofs of theorem (2,1) and (2.2) of Part I. 
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x-X = D , Xy -Xy = Sjfdj ; 




Oy zSy (I+Pif); "fyfl+pg) 


f*U ft 


PA# Jy 

3 x ‘a " *V ; Yg ‘ K P 


D * f “/ d e 


and we win also express the unknown quantity cl» *(£=I,Z,' ,k) 

in, terms of D and the independent variable T)^A=i,2A • •, k-l) 
as follows, 


d f *«c^ \fr ' ^ f]‘ * m i,2 ‘ ; 




Hence it follows from (I) that, 
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k 

P=TT . 
” f=l 






hrrSjsS 0+^)0+ A}) \i-Vf(U P f)^ 


where 


(5) 


w.*v?+ 1 


-2P. 


2 J'W 


*' * (l+A'f)* U+ty* Ul+ApU+A'P 


KM^/ 


and 




where are to be found from the equations (3) and(&-££"‘A^ 
Taking the partial derivitives of with respect to D and 



we obtain, 

* 



dPn m 1*1 

dPn 



6 D * k, * k 

ddf 


(6) 

dPn 1 6 Pn 

1 

dPr? 


drp/% dd k 


ddf 


f*l4 ' -k-J 


It can be easily vertified that = — — *o if 
dPr? 


dD dr? t if and ° nly if 


dee 


*0 


The probability P „ treated as the function of variables 

A 1 ?i'--V k . 1 . 6 i 2 - 6 MI A' ll --K'K a maximum when, 

& * V, * • %./ * • Ap A / 1.2, • • ft) 

This proves theorem ( 4.1 ). 
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Theorem (4.2). If all hypotheses are satisfied and if 
then the a! posteriori probability that the sample came from the 
universe, the quantity ^ of which satisfies the inequality jx-Xj< £ 
may be expressed by, 

€ _tf 

p a f £a z where 




~yf) " k ) 


Proof . Let P ma # denote the maximum probability/ then it 
follows from (6) that, 


^ k i 

£L, y * * Tr ; 

max i / „ _ A 


* N t 


fn 

N k 

p e € 

rr 

max 

/ 


h ~ 

= € 

77 


/ 


Jt-v} 





r}/W» 


z \L-P$(t+fy) Z 3 




e *\*-*t(*+*4r 


where the value of Wf given by (5) and w/jp 

p 

As in Part I or Part II if we expand the /oa^—in terms of 

7\ f k f k 7 ' tfi a J6 

E>,r} r j.» the first term that does not vanish 

is quadratic in form in terms of the variables, 

D, T)., ; <5,,, ; A',-- i SiJ ’<°/r ! 
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and this in turn by linear transformation can be expressed as, 

(9) 1 C f * "T - ^ * $ , 

£ !x when 

/ UJj 

. ^ i + {i-k* ] and 



To complete the proof we proceed as in Part L 



ON CERTAIN RELATIONSHIPS BETWEEN AND 
Q FOR THE POINT BINOMIAL.* 

By Margaret Merreu, 

1 , Introduction . 

The extensive literature on the point binomial covers studies 
on a variety of its properties, aprons of its use as a discrete prob- 
ability function and of its approximate representation by certain 
continuous curves. Investigations on such properties as the sum of 
its terms within specified limits, the ratio of its ordinates, and the 
slope of chords connecting successive ordinates have thrown light 
on the characteristics of the point binomial and have suggested 
various continuous functions as substitutions for the binomial ex- 
pansion. 

Prominent among such studies have been those dealing with 
the moments of the binomial and it is with these properties that 
the present paper is concerned. The first four moments have been 
used by Pearson 1 as a means of fitting a point binomial to observed 
data and he has pointed out 2 that these moments expressed in 
terms of /3j and approach the corresponding moments of the 
normal curve as r? becomes indefinitely large. Other papers that 
especially concern the following discussion are one by Student 3 in 
which he discussed the relationship between and /3^ for the 
point binomial and the Poisson exponential series, and one by Lucy 
Whitaker, 4 in which the range of and /S, and certain relation- 

*Paper No, 175 from the Department of Biostatistics, School of Hy- 
giene and Public Health, The Johns Hopkins University, Baltimore, Md. 

iPearson, K. Skew variation in homogeneous material. Phil Trans. 
Vo 1 186 A, (1895), pp. 343414. 

‘“Pearson, K, On the curves which are most suitable for describing- the 
frequency of random samples of a population, Biometnka, Vol. 5 (1906) 
PP. 172-175, 

8 Student. On the error of counting with a haemacytomcter. Biometrika, 
Vol 5 (1906), pp. 351-360 

4 Whitaker, Lucy. On the Poisson law of small numbers. Biometnka 
Vol. 10 (1914), pp. 36-71 
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ships between the moments and the constants of the point binomial 
were discussed. 

The present note will give some additional relationships be- 
tween the third and fourth moments of the point binomial, in terms 
of and /S? , and will discuss these relationships in connection 
with their bearing on the association of the point binomial and the 
normal curve. The point binomial, fp+j)* 7 \ has of course two 
constants p and n which completely determine its characteristics. 
Certain of these properties are closely connected with and/S, * 
and it is therefore of interest to see how /S 1 and /3^ change as p 
and n take on different values In order to see the effect of varying 
each of the constants, the relationship between /g. and /S^ will be 
detei mined for varying values of r? when p is held constant, and 
for varying values of p when n is held constant. In addition to 
these relationships, it is possible to see how the /3s are related 
when both p and r? are allowed to vary while certain functions of 
these parameters are held constant. In the following discussion the 
relationship between yS^ and /S^ will be considered for the cases 
where the mean, np , is held constant, and whete the square of the 
standard deviation, 7 i pg» is held constant, 77 and p being variable. 

The moments of the point binomial fp-h^-J 77 are: 


mean - np 

- npq 

M4. - 


These moments lead to the following values of the y3-s- 


( 1 ) 




(9yPJf 

r>pg- 


4 - 


1 + 


( 2 ) 
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Although the point binomial is ordinarily applied only to the case 
where vO , cf f and r? are positive, it should be noted that thes 
formulae for and are not limited to this case. The onl 
limitation on these constants is that jo a 

2. The relationship between anc ^ f or constant values of /a 
If we eliminate r> between (1) and (2) we obtain an equatio 
relating > /3^ , and/D , bemg unspecified. This equation is 


(3) 


/ 3 ^ - E3 + — 

* ^9 -py 







FIG J THE RELATIONSHIP BETWEEN ft, AND fe FOR POINT BINOMIALS HA 
CONSTANT VALUES OF p AND CONSTANT VALUES OF n 




MARGARET MERRELL 


219 


or fixed values oip , this represents a family of straight lines 
— d5 p><2- 

ith slope f , all passing through the position 

e position of the/?sj for the normal curve. Figure 1 shows a 
oup of these lines on the /S* plane, for the values of p indi- 
ted. The various positions on these lines represent /Q and/<^ 

>r point binomials having the specified pis and varying r? is . Only 
ose values of p which are between O and J are included in this 
agram, and these only for positive values of 77 , since these values 
>ver all the ordinary probability problems. From (1) it can be 
en that for such values of p and 77 , /3j is positive The /$s for 
nomials having parameters outside these limits will be discussed 
a later paragraph. 

The range through which these lines can swing can be deter- 
ined by substituting the limiting values of p in equation (3). 
he values of p to be used in determining this range are O and . S, 
)t O and / „ This follows from the fact that, since p and are 
terchangeable in the point binomial, and the slope of the lines 
ven by (3) is symmetrical i np and ^ , the lines obtained for p 
•tween ,J and J would be identical with the lines for the com- 
ement of p, jf-p y between O and . J. Thus any particular line 
presents point binomials for two complementary values of p . 

If we substitute the value J for p in (3), the equation 
icomes 

0 /<?-<? 

, we would expect, since point binomials having a p or .J are 
Tnmetrical. If p = 0 , equation (3) becomes 

5 ) 

he latter line is identical with the line giving the relationship be- 
tween and in the Poisson exponential series. This is in 
irmony with the derivation of the Poisson exponential as the 
miting case of the point binomial as p tends to O , and r? tends 
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to cv? , r?p being finite. If we denote the mean of this series by 
77p , the moments are’' 


//? - *?? 


Mj - 




0 * 71 * *77 


and 

J 

~ 777 

A> =7+ ~j * 

We have thus the equation relating and as given by (5)/ 

The radiating lines giving the foi point binomials having 
values of p between O and / , will therefore lie in the range 
between the veitical, = and the Poisson exponential line- 
This range, which is indicated in figure 1, was pointed out by Lucy 
Whitaker 0 in the paper mentioned above. The Type III line is 
included in thus graph to indicate that portion of the plane 

covered by this family of lines. It is of interest to note that /Sj 
and /S^ for skew binomials do not approach the^dW of the Type 
III curve, although Pearson 7 has shown that in an important slope 
property the skew binomial polygon and the Type III curve follow 
the same law. 

3* The relationship between and for constant values of n. 

The pZj equations for constant values of /o have been ex- 
pressed as continuous straight lines, but only certain positions on 
these lines pertain to binomials having integial values of 77 . These 
points are determined by the intersection of these lines with the 
curve relating /3j t > and r ? , when r? is held constant at integral 

^Student. Loc at. p. 353, 

°Whitaker, Lucy. Loc. cit p, 37 

7 Pearson, K. Skew variation in homogeneous material. Loc. cit. p. 357. 
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values. The equation of this curve given by eliminating /d and ^ 
between (1) and (2) is: 

( 6 ) 

This is a family of straight lines parallel to the Poisson expo- 
nential line, a specified value of r? determining a particular line. 
The intersection of any of these lines with any Ime determines 
the /9*s\ox the point binomial of specified and r ? . Figure 1 
gives the graph of three such lines for 77--^ , and 

From this graph it can be seen that even with an 77 as small as 
/3j and /3^ for the symmetrical and slightly skew binomials are 
not far from the position of the of the normal curve, but for 
the highly skew binomials, they are quite far from this position. 
This is in agreement with the fact that the more skew the bino- 
mial, the larger the n required to make the normal curve a good 
substitute for the binomial expansion. 

It is evident from this graph that as r? is fixed at increasingly 
large values, ^ and for the point binomials of different Is 
converge quite rapidly toward the normal position. The limit of 
equation (6), as 77 becomes indefinitely large, is equation (5), 
that is, the line giving /5j and for the Poisson exponential. 
This is to be expected, considering the conditions under which the 
point binomial approaches the Poisson exponential. For this lim- 
iting case the /3j/&> line for constant 77 crosses all the radiating 
p lines at 0 , 3 , except the line for p> = O , with which it coincides 
throughout Thus for all values of p , except O >/Sj and /3g for 
the point binomial agree with the corresponding moments of the 
normal curve, as n becomes indefinitely great. 

4. The relationship between /3j and ^<3? for constant values of 

In judging how adequate the size of a particular sample is, for 
a specified value of p > , we frequently make our estimate in terms, 
not of 77, but of the mean value, r?p. This is with the thought that 
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we are in approximately as good a position with a p of / and an 
77 of 50, for example, as with a p of 01 and an 77 of 500, since 
the expected number is the same in both cases. For instance, our 
knowledge of a penny from IQ tosses is about as complete as that 
of a dice from 30 tosses. Tu study tins question from the moments 
of the binomial we can determine the curve relating the /S3s for 
binomials of constant np , by replacing np by 777 m equations 
(1) and (2), and eliminating the remaining /pi and dr between 
these two equations. This gives the equation 

(7) 

- 4 777 {4? Try -iJs-Oj + (4? 

This is the equation of a hyperbola with asymptotes 

( 8 ) 

(9) 7$v<3 ' 

The substitution of any particular value of rr? in equation (7) 
will give the curve of /^ifor all binomials having this specified mean 
value. Its intersection with the radiating lines of constant p gives 

and for the point binomial having the specified p and 
mean value Figure 2 shows the two hyperbolas for which 777 =2 
and 10 respectively. 

Turning to the asymptotes of the hyperbola, it will be seen 
that only one of them varies with 777 Thus the various hyperbolas 
obtained by substituting different values of 777 m (7) will all be 
asymptotic to the same line (8), The centers of all the hyperbolas 
will therefore he on this line, which, it will be noted, is the Type 
III line The other asymptote is parallel to the Poisson exponential 
line and a comparison of its equation, (9), with equation (6) 
shows that it represents the same relationship between , 

and m, as that previously derived between /3 Z , /£, , and 77. This 
asymptote is therefore the particular line m the family of 77 lines 
for which r?* ? 77 or np, Since any point on the hyperbola is the 
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HO 2 THE RELATIONSHIP BETWEEN f3 } AND p g TOR POINT BINOMIALS HAVING 
CONSTANT VALUES OF np 

intersection of r? and /? lines such that ??/? has the constant value 
m , it follows that points farther and farther out on the hyperbola 
represent the intersection of lines having values of r? closer and 
closer to the mean, 77/?, and values of approaching the value 
If we consider this asymptote for hyperbolas of increasing values 
of rn we see that it approaches the Poisson exponential line, and 
in the limiting case, as m becomes indefinitely large, the hyperbola 
degenerates into the two lines which represent the or the Pois- 
son exponential series, and the Type III curve. These limits will 
be further discussed in a later paragraph. 
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All of the hyperbolas in the family represented by equation 
(7) are tangent to the line /3 2 *0, which is the value for binomials 
of p* J. From figure 2 it can be seen that the lines giving the 
/3 s for the other values of p are crossed twice by each hyperbola. 
This is due to the fact that each line represents the /3!s for two 
complementary values of p , and for such values of the same 
mean value would result fiom two different values of n , such that 

m - 77/0 - 77 

For example, the binomials f £ * 8j*° and f have 

values that lie on the same stiaight line and mean values that lie on 
the same hyperbola It is thus obvious why the hyperbola must be 
tangent to the line for p = J”, since m this case the two complemen- 
tary values of p are equal and theie can be only one value of n 
which will produce a given rr?. 

From the discussion of lines relating^ and /^ for constant 
n values, it is evident that of the two crossings of any p line, the 
one nearet the Gaussian position is for the point binomial with the 
larger r? and therefore for the smaller of the two complementary 
p values. Furthermore, through the point of intersection of two 
hyperbolas, only one r? line and one p line will pass, and the 
/3 s thus determined are for two binomials, one having the smaller 
mean and the smaller / o , and the other the larger mean and p , 
both having the same r ? , For example, the two hyperbolas given 
in figure 2, intersect at the point = % £666%/Q*3i The value 
of n for this position is Jg , and for p is J/<5 or yJ/<5. These 
values of and^ are therefore for the binomials ft/tf 
with the mean value A 3 , and with the mean value 

10 . 

From figure 2, it is seen that the hyperbolas extend into the 
area between the Poisson exponential line and the Type III line, 
Since, as stated above all ps between O and 1 fall in the area be- 
tween /3j, s( 3 and the Poisson exponential line, it raises the ques- 
tion as to the meaning of the hyperbola outside that area. In the 
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point binomial [ ^ FT-/&V] , there is nothing to force to he 

between O and J , and n to be positive except the conditions we 
impose for applications to probability problems. If we consider 
the general case, without these limitations, an analysis of equation 
(3) shows that the radiating lines giving /3 X and for fixed 
values of continue into the area between the Poisson exponential 
and the Type III lines, the values of /o in this area being either 
negative, or the complements of these negative values, that is, pos- 
itive values greater than J . This area includes all values of /v 
from O to - oo , and from / to * °o . Thus, lines giving 
and for all real values of /=> from - aa to t oa are 
included between the vertical and the Type III lines. Outside this 
area, the values of/? are imaginary. Turning to the values of r > , 
we can see from equation (6) that in the family of parallel lines 
giving^ and xg, for fixed values of r? the lines below the Poisson 
exponential all have negative values of n . As r> approaches - o a , 



FIG 3 SUBDIVISION OF THE fit . fir PLANE FOR BINOMIALS CLASSIFIED ACCORDING TO THE 
VALUES OF P AND n 
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this line approaches the Poisson exponential line from below. Fig- 
ure 3 shows the subdivisions of the plane for the various 

cases. 

It follows from this discussion that the values of the hypeibola 
in the aiea between the Poisson exponential and Type III lines 
give /5s for point binomials with negative and since /?p is a 
fixed positive value for any hyperbola, negative values of /o , rang- 
ing from O to - oa . These results are in harmony with the case 
mentioned above, where the hyperbola degenerates into two straight 
lines (the Poisson exponential line and the Type III line) as the 
mean becomes indefinitely large, for r?p approaches cx> when 
either n or p approaches oo Thus, the Poisson exponential line 
is the limit when n approaches c*? , and p-J , and the Type III 
line when p approaches - os , and n is negative. 

5 The relationship between ,<5j and for constant values of 

pp 9 . . 

A further point of interest is the scatter of the pfsior point 
binomials of varying pA- but constant standard deviations. In 
equations (1) and (2)> if we let 
g we have the equation 

( 10 ) ' 


ppg - cr^ 9 and eliminate p and 


6<t-J 

cr * 


= O 


This, like the lines giving /5j and/5, for constant values of r? , is 
a family of parallel straight lines, but where the r? lines were par- 
allel to the Poisson exponential line, this group is parallel to the 
Type III line. These lines intersect the radiating lines giving the 
/3s for constant values of p , and as cr* increases, the points of 
intersection of these lines appioach the /5j and/51, for the normal 
curve. As approaches c*? , the line given by (10) approaches 
the Type III line, and in the limiting case, crosses all the p lines 
at the Gaussian position. 
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6. Summary . 

Certain relationships between the third and fourth moments 
of the point binomial in terms of andpZ, have been discussed 
and the following results have been brought out : 

A. For fixed values of p ,/jf, and/2, are linearly related, 

forming a family of radiating lines, all passing through the posi- 
tion of the /Ss for the normal curve. Each of the lines represents 
the/ds for point binomials having a fixed value p or its comple- 
ment, J-/Q . The lines for values of p between O and 3 are 
included between the vertical /dj - O , which is the line for p«.J 
and the Poisson exponential line, - 3* O, which is the line 

for p^O or 3 . The lines for negative values of p or positive 
values greater than 1 , fall between the Poisson exponential line and 
the Type III line, - <5 -O* For the rest of the plane; 

the values of p are imaginary. 

B, Although it has been shown by Pearson that in certain 
slope properties the skew point binomial resembles the Type III 
curve, none of the binomials which we interpret as probability func- 
tions, that is. those haying p between C? and 3 and 77 positive, 
has /dy and p approaching those of the Type III curve, except 
for the special case where the Type III curve becomes identical with 
the normal curve. 

C. For fixed values of r? , /d^ and /d^ determine a series 
of straight lines parallel to the Poisson exponential line. For posi- 
tive values of rt , these lines are above the Poisson exponential 
line, and for negative values below this line. Intersections of these 
lines with the radiating p lines determine^ and ^ for the point 
binomial of specified p and r? . As 77 is held constant at in- 
creasingly great values, the points of intersection are closer and 
closer to the position of the /dS for the normal curve, and in the 
limiting case the line oi/ds for constant n intersects at the normal 
position all of the family of p lines except the line for O , 
with which it coincides. 

D, pd; and /d? for point binomials of varying and r? , 
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but constant mean values, lie on a family of hyperbolas, a particular 
hyperbola being determined by a specified mean. One of the 
asymptotes of all these hyperbolas is the Type III line, and the 
other is a line parallel to the Poisson exponential line, at a distance 
from it, depending on the value of the mean. The limit of this 
asymptote as the mean approaches ^ is the Poisson exponential 
line. These hyperbolas are tangent to the line * O , (the line 

for cf) and cross the other /d lines twice, the intersection of 
the hyperbola and any /d line or any r? line determining the^^ 
for the point binomial whose /d and n are defined by the intersec- 
tion. 

E. For varying /d and r ? , but fixed n and/<J lie 

on a family of straight lines parallel to the Type III line, one line 
in the group being determined by a particular The limit of 

these lines as ry/Dg. is held constant at increasingly large values is 
the Type III line. 





EDITORIAL 


NOTE ON THE COMPUTATION AND MODIFICA- 
TION OF MOMENTS 

For tlit pm pose of this note we shall deviate fiom the usual 
practice in the calculus of finite differences and define 

Au* - - M ' a x' 

where M is a constant. It follows that this generalized A and 
the symbol E aic connected b) the opeiator relation 


A = (E- M} so that 
A n *(E’M) > and therefore 


n 


(1 )Au^u 


x+n 




W/Z**' 




If the n-th unmodified moments about an arbitrary origin, and 
about the arithmetic mean, be designated by t/ and \> n , respective- 
lj , the usual relation may be written 



where y equals the distance of the mean from the provisional 
mean, From (1) and (2) it follows that 
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that is, the n-th moment about the mean is simply equal to the nJh 
leading difference of \} 0 . Since computing machines are ideally 
adapted to computations of the type (A-B Cj, formula (3) is very 
effective. 

As an illustiation, let us compute the hist seven moments about 
the mean for the distribution of weights of 7749 adult males born 
in the British Isles. (See Yule’s “Theory of Statistics”, p. 95.) 
The provisional mean is taken in the 150-lb. class, and the class 
interval of ten pounds is taken as the unit of x . 


Table 1 


n 



AV n 




0 

7749 

1 000000 

000000 

4.55356 

692736 

91.3249 

1 

1726 

222738 

4.55356 

7.94161 

92.8679 

456 762 

2 

35670 

4.60317 

8 95586 

94.6368 

477.447 

4471.10 

3 

77344 

9 98116 

96 6316 

498.526 

4577.45 


4 

766026 i 

98.8548 

520.050 

4688.49 




5 

4200496 I 
38164290 1 

S42.069 

4925.06 

4804.32 





6 



n 

A*Vn 







1 

436.420 

4272.15 






2 

4369.36 



It is very important that the provisional mean be so chosen 
that M= V f is less than unity — otherwise particular attention must 
be paid to the number of digits which are significant m the values 
of the various differences. 

Let us now discuss the modification of moments. 

Designating the general modified or corrected moment by ju n » 
Sheppard’s formula for continuous variates may be written 


(4) 


J n-4 


" 7 ) <31 ,/ 

6/ jj44 v *-6+ 


so that for moments about the mean, 



EDITORIAL 


231 


( - _ i 

' *2 "X? 

M3 = \? 

^ ^ = ^4 ~j ■ / ',db 

-^f ^s~6 ^3 

- -/ £ -> Z. i H 

^ Md ~ ^6 ~ 4 ^ +/6 %? "/ S ^ 


In many distributions discrete variates are grouped into classes, 
each class containing A' different values of the variable. The for : 
mula, corresponding to Sheppard's, for grouped-discrete distribu- 
tions may be obtained by employing the calculus of finite differ- 
ences, and was given without proof in an Editorial on page 111, 
Vol 1, No. 1 of the Annals as follows, 


M.-W 


V-LS- 

?/ 1Z 


140 


JL J 




1344 


T* *k 


^r>-6 * ' 


Obviously the limit of (6) as k approaches infinity is (4), So 
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In both (5) and (7) above it is to be understood that the class 
interval, A , is chosen as the unit of ^ . 

The modified moments,/?^, of the following table are ob- 
tained by applying formulae (5) to the values of \? n , which were 
obtained as the leading differences of table 1. 


Table 2 


n 




cm 

0 

1 00000 

1.00000 

1.00000 

1.000000 

l 

1 00000 

.00000 

2.114292 

,000000 

2 

4.55356 

4.47023 

4 47023 

1 000000 

3 

6.92736 

6 92736 

9 45137 

.732948 

4 

91.3249 

89.0773 

19.9830 

4 45765 

5 

436.420 

430 647 

42.2499 

10.1929 

6 

4272 15 

4159.96 

89.3286 

46.5692 


Tables 3 and 4 shed an interesting light on the subject of mod- 
ification, and are obtained from the results of formulae (5) and 
(7). If the class interval be denoted by A , and the unmodified 
values of the standard deviation and skewness by cf J and , re- 
spectively, that is 


(8) 


it follows that 




< 


^ / FT 


v. 


3 X 




( 9 ) 


<7y = <?; CO 


= 


<3_ 
CO 3 


CO=^ 


U- 


Lm 

12 1 


and 

where 


'A 
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As mentioned before, the case of continuous variates is the special 
case for which A** 

To illustrate* for our distribution taken fiom 3 ule, 10 Ibs. } 
and by (8) 


a' - 10 J 4 553 56 /fa C 21. 3391 /fa 


, 6 92736 

= fc 13391? 3 


= .712919 


9p = 463623 
u v 

rt 

From table 3, Am 4?k=oo we have co - 99&7 and con- 
i'' 

sequently cr y * 99&7 r 21. 3391 !b$ ) - 21 14 /bs. % agreeing 
with the more exact value deduced from table 1 , i.e. 21 14292 lbs. 

From table 4, 2P a 47, k- oo> wc have cJ - 1, <328 and there- 
fore ^ - 1928? 71591 9 J = 7^25? , again agreeing with 

the more accurate \alue of table 2, 

By either interpolation of tables 3 and 4, oi by direct compu- 
tation of co and c o 3 , greater accuracy may be obtained 

For an illustration of grouped -disci etc variates we may refer 
to pages 32 and 37 of Vol. 1, No. 1 of the Annals For the so-called 

D(4.1) of Table IX, A = 4. a fa 508? fa = 096. Hence 
= 79 > and since k= -^the modified or adjusted values are 


4,. S 08? fa 9353) --4 963 

<*=. 096 ( 1 . 078 )- 103 . 

o 


It should be observed that the factois cu and co ^are indepen- 
dent of the number of variates, A/ , and are just as properly ap- 
plied even if the frequency distribution method for computing the 
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standard deviation, skewness, etc is not employed. Thus, if I com- 
pute the standard deviation for the weights of ten individuals and 
those variates are recorded to the nearest pound, then the resulting 

"V N 

theoretically, if not practically, should be modified. If it developed 
that crj = 20 lbs M then for weights to the nearest pound A- 1 , 
and 


-/'-J'Ji- -999896 


If. on the other hand, the weights had been taken to the nearest 
half pound, or to the nearest tenth pound, the corresponding values 
for oj would he 999974 and 999999 respectively Only if the 
variates he discrete and k* 1 , or if the variates be continuous 
and be measured with absolute accuracy (which is impossible from 
a practical point of view) so that A = O , can co- I , and modifica- 
tion be ignored. 

It should be cleaily understood, however, that Sheppard’s cor- 
rections are merely expectations. We have no assurance that the 
use of one of these corrections in any single instance will increase 
the accuracy of that determination — it is quite likely that in any 
isolated case the modification will introduce a still greater error 
into the calculation As pointed out m pages 36-3S of Vol. 1 of the 
Annals, and clearly revealed m the included table IX, modifying 
eliminates only the systematic errors, and ignores the accidental 
errors which may be numerically greater and of opposite algebraic 
sign than the correction itselt, 

Lastly, one must remember that the mathematical theory under- 
lying Sheppard’s Corrections assumes that both the frequency 
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function and a sufficient number of its derivatives vanish at the 
limits of the distribution, Consequently the case of J-shaped or 
U-shaped distributions, or of data not actually classified into such 
distributions, is not covered by formulae (5) or (7). In any event, 
it is evident that the practical necessity for modification in all cases 
depends upon the ratio of the limits of accuracy of the measure* 
ments employed in the computations to the unmodified standard 
deviation, If we throw accurately determined variates into fre- 
quency distributions with large class intervals, 'A and thus sim- 
plify certain computations, then we must realize that the introduc- 
tion of a systematic error is the penalty paid for such procedure, 
that the greater the value 4 , the greater the penalty, and that 
the practical necessity for modification rests entirely upon the ac- 
curacy demanded of the final results, 

The chief value of tables 3 and 4 is that an inspection of these 
tables gives a rough idea of the value of modification so far as the 
standard deviation and the skewness are concerned. 





236 


COMPUTATION AND MODIFICATION OF MOMENTS 


















F DITORI 1L 


237 
























2.18 


COMPV I A'i ION 1 YD MODIFICATION OF MOMENTS 


Table 4 


r 3 . [/. L±*/J- 
' L * ’ < 














EDITORIAL 


239 












SIX DOLLARS PER ANNUM 


NOVEMBER 


1933 


PUBLISHED QUARTERLY BY 
AMERICAN STATISTICAL ASSOCIATION 


Publication 0 /Tice— Edwards Brothers, Inc., Ann Arbor, Michigan 
Btuinfss Office— 530 Commerce Bldg,, New York Univ., New York, N. Y. 

Entered as second class matter at (he Postoffice at Attn Arbor, Mich., 
under the Act of March 3rd, 1879. 


STATEMENT OF OWNERSHIP 
UNDER ACT OF CONGRESS OF AUGUST 24, 1912 

Publ irfter— American Statistical Association, New York City, New York. 

B4lkr—Pl. C, Carver, University of Michigan, 

Mdiiaglug Editor— Rattan g, Seklion, Ann Arbor, Michigan, 

fytsfaeu Manager— l W, Edwards, Ann Arbor, Michigan. 

Owner— American Statistical Association, 530 Commerce Bldg, New York 
City. 



the extended probability theory for the 

CONTINUOUS VARIABLE WITH PARTICULAR 
APPLICATION TO THE LINEAR 
DISTRIBUTION 

By 

H. P. Lawther, Jr. 

The engineering worker is often confronted with the necessity 
of utilizing a group of quantities concerning whose numerical val- 
ues it is known only that they lie between definite upper and lower 
limits. If a number n of specimens is selected from such a group 
and the sum of the n values taken, intuition rules that there is 
negligible probability that this sum will be as great as n times the 
upper limit or as small as n times the lower limit, and that the 
most probable value must be intermediate between these two ex- 
tremes. Some assurance is desired regarding the practical limits 
within which such a sum may be expected to fall. While the dis- 
tribution oi the individual values within their limits may be un- 
known directly, yet workable inferences frequently may be made 
from the nature of the quantities. For example, in many manu- 
facturing operations it is economical to turn out items (such as 
bearing balls, paper condensers, or spacing washers) in large quan- 
tities with rather coarse precision. By means of gauges set to 
limits narrow as compared with the total spread, the product is 
then selected into bins, and in the operation of assembly a com- 
pleted article utilizes the material from a single bin. The contents 
of any such bin clearly may be expected to follow a linear distri- 
bution very closely, and if the relative proportions of the product 
finding their ways into this bin and its immediate neighbors can 
be learned, the distribution may be specified with practical accu- 
racy, The linear distribution is thus fundamental to a large clars 
of problems. 

On several occasions the writer's speculations have led to pro 
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**" , W t«# ri ” ,t l° n ’ 8nt * t * le re f erenc e literature 
S " f, "» **«n 4 first jk s P ec ' a * 0336 rectangu- 

v *.«t 3 ty ' ®. ave been formulated by Laplace 1 

' letz ' Irwin, 1 Hall, 4 and Craig, 0 in 
**•' t a, ' a V se s applicable to the study of the 
*, fdls* ( . mm a some what different viewpoint. In 
'*%*►*, mH C oglcal Processes of specialists in this 
^ 1|( - VCn lQ P ut forth considerable independent 
*4 >H» ,4^ u!|? 81 * Sat,sfact °ry understanding. As the 
*"° f flr an * l f of approach was developed, 
terminology familiar to one whose 
_ hf . h ™ " ave been limited to that commonly 

* r, ff iiw ^nng course, and should be readilv 
W * H W»WiiF U> 4 tyjtld I i -4 

c iieca of workers. Encouragement was 

1 the case of the generalized linear distribu- 

» uiwl w >ch appears to be new. In the application to 

ttete^ry to carry out certain tedious compu- 

ting values and curves. The results of this 

*4* i**ld with the thought that they may stimulate the 

^ ft lft w of considerable application to 

ii «l hr that when a selection, or the sum of n 
C) ^ there is meant the dimension of that selec- 
dimensions of the n selections. Following 
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$ H- .* / 
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the usual notation, the symbol f n fo) will be defined to be such that 
b 

the integral J f n (*0 dx is equal to the piobability that the sum 
a 

of n selections lies between the values a and b . Neglecting 
higher orders of infinitesimals, the probability that the sum of n 
selections lies between ^ and x+Ax would then be equal to the 
product f n (x)- Ax The sum of n selections clearly is the sum of 
n - 1 selections plus the value of an additional selection The prob- 
ability that the sum of n selections lies within the interval # to 
x + Ax must then be equal to the summation of the probabilities 
associated with all possible pairs of values for the sum of the first 
n-1 selections and for the last selection, respectively, that can 
yield a final sum lying between X and x+Ax The values x-m A?c 
and m Ax , where m is an integer, are such a pair, and the totality 
of these pairs is obtained by extending m to all possible values. 
Recalling that the probability of the simultaneous occurrence of 
two independent events is equal to the product of the probabilities 
associated with their individual occurrences, there may be written 
in the conventional symbols 

m - °o 

f n (x>)-Ax =£, t i(^-m A%)-Ax, f,(m Ax,) Ax, 
m*-<» n ~ 

Setting m Ax = A , and passing to the limiting form, there is 
obtained 


f n C*)*/ f n _j (x-A) f, (A) dA , 

-00 

as the general formula for determining all subsequent fi from 
f, (x) . The form of the function f, (x) is, of course, determined 
for any particular case from the best available physical data. Ihe 
expression for f n &)is then obtained by n-1 successive applications 
of the operation of integration indicated above. For the sake of 
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subsequent brevity, the operator P will be defined to be such that 

CO 

Pcp(») a (A)- dA 

- 0 $ 

Usin^ this notation it would be written 

f n u). P n4 f t (x). 

For the general linear distribution is given as follows: 

f j (x) = 0 , for - oo < x < 0 

f (X). 0 , for Clc^<co, 

the equation of a straight line for 0 = x = a , subject to 
the conditions: (1) the area under the line from x, = 0 to x~a is 
unity, (2) no ordinate is negative for any value of x in this in- 
terval By imposing these conditions upon the general equation to 
a straight line there is obtained 

f, fx)=^ [l + k- — (a^)] , for Oix~a 


where the parameter K is restricted to the values -4= KM . With 
f/tf) so defined it can be inferred immediately that f n fr)rnust be 
identically zero for all negative values of X , will have some positive 
finite value everywhere m the interval 0 < X < n a , and must be 
identically zero for all values of X greater than na . Also, since 
f\| (x) is discontinuous for x; and for x — a the application 
of the operator P must be effected through proper choice of limits 
of integration. In this connection three possible cases arise: 

Case J ; Where x , the sum of n selections, lies in the interval 

0 £ x £ a it could have resulted only from a value for the sum 

of ni selections lying in the interval x to 0 , coupled with a suitable 
value for the n-thselection lying in the interval 0 to X . For this 
case the operator P will be distinguished as follows : 




If. I‘ f 111 / ///• A\,./A* 


245 


Case 2: \\ heic x , the sum of n selections, lies m the inter- 
val a £ it could have resulted only from a value for the 

sum of n-1 selections lying in the interval x to x-q , coupled 
with a suitable value for the n-th selection lying in the inteival 
0 to a . For tins case the operator P will be distinguished as 

follows : a 

+ „W-P i (A)-dA , tor a = X = (n-l)a . 

0 

Case 3: Where X , the sum of n selections, lies in the interval 
(n-l)a = x ^ na it could have resulted only from a value for the sum 

of n-1 selections lying, in the interval (n-i)a to x-a, coupled with 
a suitable value for the n-th selection lying in the interval x-(n-i) a 
to a. For thus case the operator ~P will be distinguished as follows : 
cl r a 

f n (*0= P ‘ f n-i ^ ^ f* (% ) dA for , (Vvl )a $ x ^ n a . 

The procedure now is analogous to that employed in establish- 
ing the binomial theorem The first few f s are obtained by hand- 
power methods, until the sequences can be discerned and the ex- 
pression for f n ( v) can be inferred The expression for f n (x) is then 
established, first by applying to fe) the operator P and showing 
that this yields an expression for f n+1 M wholly consistent with that 
for f n (x,) when n+1 is substituted forn, and finally by showing 
that it degenerates into fj (x) when n is taken as 1 . 

The preliminaty steps, while \ery necessary, are quite tedious, 
and there would be no value in repeating them here. Suffice it to 
state that by such means it can be inferred that f n M is of the form 
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where it is understood that each term including a bracket member 
of the form [na - ba~x] is to be assigned the value zero for values 
of % which render this negative. The use of brackets [* *] dis- 

A 

tinguishes the operand in each term. The symbol denotes the 

operation of integration with respect to between the upper limit 
X and that lower limit for which the integrand vanishes. Thus 


1 

P 


[na-ba-xi] 


tn 


1 

m+i 


[ na- ba - x] ' 


. Where p occurs 


with a negative exponent it signifies the inverse operation of differ- 
entiation with respect to x- . The symbol means ~ - g yj and 

is one of the familiar binomial coefficients. 

Preparatory to establishing the validity of the inferred expres- 
sion for f n (x), it is convenient to assemble certain working ma- 
terial. First <p n (x) will be defined as follows : 

<Pn )rna-a-^l 

|l-K*|-p [na- vii-x.l 

where the symbols all have the same meaning as before, but here 
there is no special understanding regarding the bracket members 
of the form [na-ba-x] and they are to exist for all values of a; . 
Especial note should be made of the inclusion of a final term in 
. Otherwise the expression is identical in appearance with 
that for f n ^)for the interval O^xia. Next, the typical opera- 
tion 
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is readily evaluated and yields 

Now it can be written immediately that J 

w 4> n W-jsa(-i ) {(l«*^r‘[na.ci-Ji]-(ltK.» ) n (i-K*5-p)["«-> i ] 


Collecting terms, this becomes 





[-*]}■ 


and this is seen to correspond exactly with the expression for $ (x) 
if n+1 is substituted for n. Consequently, (p n (x) must be the 
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result of u~i successive applications ot the operator to a 

certain <£) (x) given by 

which is seen to he identically zero {or all values of x . Now the 
application of the curator Q P to zero yields zero. Therefore 
(p n C«)must be identically zero for all values of x, . Finally, it is 
convenient to evaluate the typical operation 

Jp (a-A)].d/\ . 

This yields 

The expression fot f n (x) now may be established in straight-for- 
ward fashion. 

In the interval Oi x $ a *he expression for f n (x) will con- 
clude with the term involving * As has been shown before, 

the expression for (x) should then be given by 

f n+i f n ior 

This operation may be evaluated readily, and will yield a result 
that is correct. The form of the result, however, is such that it does 
not display the desired correspondence with the expression for 

f n (x) . The expression for (p n (*) is introduced here to advan- 
tage Let it be written 

W*K Pf n 


for d=x = a. 
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Since the last term is identically zero its introduction is peimissi- 
ble. Remembering that (p (x ) consists of all the terms of f (*.) 
plus a final term in |_-^J , a rearrangement may be made giving 

f n^ + o p ^fp) r,2 W i ' K+ 5p)M. for 0=^=a. 

Using the operations that have been evaluated above, there is ob- 
tained immediately 

ff 1 +K+ |p) CnQ+o-^-Cl + K^) n fl-K+^)[na-»J 


i ) ( 1+K+ % ) n ( 1 ' K+ % ) £ na -*J - — 
+ (l)( i +K+ S) ri'K + f^) Z rna-a-x] 


+(-l) ’(n n -l)(l+K^) (l-K+|^y [Za-x] 

-(-O n ‘fe)(i + ^|Xl-K + ^) n Ca-,]) 

B)" 1 [W n (l+K^)(l-K+^ )V*]},for 0«» * a. 


Collecting terms, this becomes 
m+1-2 


LMiy-n^rjViM 


W^inKl pj l 1 


ap 


n+l'i 

1 




ap i 

2K^/, . Ztf^r 




ap' 

for 0$ *■ =a 
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and this is seen to be wholly consistent with the formula for f (*) 
for the same interval with the substitution of n + 1 for ri . 

Now for some interval between x = a and x-n a, say (na- ba-a) 
& X £ ( na-ba) where b is an integer having any value from 
zero to n- Z , the expression for f (x) will conclude with the 
term involving [na-ba -xj , For the interval immediately preced- 
ing, namely [na-ba ~ Za)ix> £ (Yia-ba-a), the expression will 
include the additional term involving [no - ba-a-x]. Therefore in 

a 

evaluating the operation P* f n 0<0 , which as has been shown be- 
fore should yield the expression for the integration of all 

but the [na-ba* a -x] term will be carried over the complete range 
of P from 0 to a . The term involving [na- ba-a-#] will not 
enter into until P reaches the value (x- na+ ba +a), and 

so the integration of it will be between the limits (#-na+ba+a)and 
a . Using the operations that have been evaluated previously there 
is obtained immediately 



for (na - ba-a) a x£ (na-ba). 
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Collecting terms, this becomes 




a 


n +1 


/ i\n+Mf 

VpJ 


{<> K+^y"‘[(n+l)a-*] 

(TX UK+ r P ) f 1 ' K+ S)I< n+1Ja - a ’*]‘- 


ml . 


■^rOKr fl-K^fWDa-r^a.] 

for (na-ba-a)^l(ha-ba). 

and this is seen to be wholly consistent with the expression for 
f n (x) for the same interval with the substitution of n+i for n . 
Finally, for the interval na a) the expression for 

(x) is identically zero. For the interval immediately preceding, 
namely (na-a.) ^x = na , it consists of the single term involving 
[na-AsJ. Therefore 

f n + l^V?a f n^V?a ^ (~p) ( ±+H ^p ) n tna ' x] 


d S A \ n+i-i / 2 K \ r , .V 

* — ^ (d) — ) [(n^i)a-A;] for naS;tfi(h4-l)a 

q i tJ ap 

and this is seen to be wholly consistent with the expression for 
f n (*) for the corresponding interval. 

Setting n*i in the general expression for f n (x) for the in- 
terval O * x = a there is obtained 

Carrying out the indicated operations, this becomes 

f i (*) « ~ (a-x )3, for 0*x*a 


and this is the f, (x) chosen at the start. 

In the derivation of f n+ { (x) from f n (x) it was assumed that 

the two forms of the operator P , namely P and a P , are com- 

mutative with the operator i when applied to an operand of the 
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form [J3->J It is veij «is> to show that^P.p [fi-xj yields a result 
identical with that of q P [B . *,] and that a p.j| [fi-x] 
vicMs a icsuli idcMitual with that m - p I 1 , and the 

r x', & J 

space will not be taken here *o ip\e this demon *4 ration The ior 
inula fot (x.) may thus he regarded as fiimly established. 

It has been shown that the complete expression (%) is iden- 
tically zeio for all values of x Theiefore, if in the interval 
(na-ba-a) ^r(nG-ba) the desired function f (%) can be 
represented by the paitial expression 



it follows that it tna\ equally well be ^presented in the same inter- 
val by the negative of the remainder of the complete expression, or 


'nWvff)" [«*E )6 ta-ba-a-AjJ 

(-I) n -‘(^)(UK* |K W] 


Kenietnbering that ^ j^fn - 2 ) * etc , this last may be 

rewritten 


Wfn'1 


P 

'n 




,- k+ 2KP- 2 / 

ZJV op, 
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where again it is understood that each term including a bracket ' 
member of the form [*-ba] is to be assigned the value zero for 
values of ^ which render this negative. The having of these two 
forms of expression for f (z) is very valuable in computation 
work, since it limits the number of terms that have to be handled 
m2, . 

to a t most. 

Setting K equal to zero gives the special case of the rectangular 
distribution, and the expressions for f n (x) reduce to the forms 

f n (X) 't4p) |[ na - >s 3-'(i)[ ria -^3 + (z)t na - 2a '^ ] » 

and 

Carrying out the indicated operations, these become 

and 

This last expression is the one usually found in the literature, and 
it was originally developed by Laplace as the limiting form of an 
urn problem. 

Setting K equal to plus one or to minus one gives either of 
the extreme cases of the “right triangular distribution 0 For K 
equal to plus one the expressions for f n 60 reduce to 



and 
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The function f n fc) normally has no direct practical application, 
but it is of interest to see its trend with increasing values of r \ . 
There are shown on Figure 1 several members of the family of 
curves originating with the rectangular distribution, and on Figure 
2 corresponding members of the family originating with the right 
triangular distribution. In both figures the interval a has been 
taken as unity, and there have actually been plotted the curves 
y* 0'f n (r\'X) . This change in variable places all curves to a 
common base, and at the same time preserves the property of the 
total area under each being unity. 

For the sum of n selections the practical worker wants to know 
the minimum value x 1 ; or the maximum value X 11 ; or, most often 
of all, the shortest interval x' to a; 1 ' associated with a certain 
probability value. The probability that the sum of n selections 
will be less than x l is given by 

F n (*')•/” »(*)-<&• 
o 

and the probability that the sum will exceed x" is given by 


r na 

F n c *>/ f n OO 

. , r*' i 

Noting that / DtJ-ba] cU«/ fc-bd.cU-i-C# 1 - baj and that 


« no na-ba . 1 

/ [na-ba-xjdx^/ [na-ba-x]dtf*-~*[na- ba-xj, since the 
J x" -V p 


bracket members are assigned the value zero for values of x which 
render them negative, there may be written immediately 

and 

~(i)( i+K+ ^) n T 1 '^ Jf naW >GX 1+K+ «^ ) n 
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FIGURE -"z 

GRAPHS OF CURVES yn[,(tn) FOR RIGHT TRIANGULAR DISTRIBUTION 






ii p l m then, jr 




Foi x - na or for 0 these last expressions tespectively should 
equate to unity, the total area under any probability curve. It is 
simple to verify that they do so, and the demonstration will be 
given for one of them For the interval O^'^a the expression 
for FJ-jOc") concludes with the term containing Let there 

be added and subtracted a term involving to give 


n-1 
n-JL 


( 1+K+ j^) n [ na '*“3 


pj 

A-Ta-** % ) n r-«"j]-~n ) n £0. 

From inspection it is seen that this may be written 


FFV' 1 (-0 


ZK 

ap 



where the expression introduced previously* Upon car- 
rying out the operations it is found that - ~ (f t ( x>") - 1 , and 

It follows therefore that 


iSM-i- 


a 


n-l 


1-K + — ) .for 0£x = a 
ap/ J 


and it is apparent now that for %" -O this expression is equal to 
unity. Thus it is seen that the function F n (x) also possesses an 
end-for-end s>mmetry similar to that of f n (x) The complete 
expression corresponding to F n C>^) equal to unity instead of 
zero, however, and where the desired function is represented by a 
partial expression it can also be equally well represented by one 
minus the remaining terms of the complete expression, 

Refenmg to Figuie 3 it is ^een that the sum of aiea A 
plus aiea lb or F~ n (x‘)+ F n ( x"), gives the probabiht} that 



iMguie 3 
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the sum of n selections lies outside the interval X' to X H , In 
an actual problem usually either the sum of A and B would be 
assigned and it would be desired to find the interval x 1 to x M 
associated with this expectancy, or the interval x' to x M would 
be assigned and it would be desired to find the expectancy asso- 
ciated with these limits. With f n (*0 of the character shown in 
Figure 3 and with the sum alone of A and B fixed, any number of 
pairs of values are possible for x 1 and x” . It is also clear that 
the length of the interval x"-x‘ will depend upon the relative mag- 
nitudes of A and B. There are two special cases, however, which 
cover all normal demands, It is seen at a glance that the interval 

x“ - X* will be shortest for a given A plus B when f n (x") = *f n (x 1 ). 

Purely from the standpoint of deviations, this shortest interval 
represents the optimum results of which the group is capable. 
Where the absolute magnitude is of primary concern it might be 
speci fied that x 1 * n ol - x ", 

The function which is of final interest, then, is represented by 
the sum F n (V)-t- (x") , subject either to the restriction that 
f n (x') = f n (V‘) or to the restriction that X 1 * na- . For the 
special case of the rectangular distribution f n (*0 is symmetrical 
about the line x = and the two restrictions imply the same thing. 
The symmetry of the rectangular distribution permits giving for- 
mal expression to the sum F n (x , )+fj 1 (x‘9as a function of (x -X 1 ) 
when x 1 ^ na-x" . Under this condition the sum becomes equal 
to ZF n (V) and x 1 may be written as " and there is 

obtained 

f>,M + FnC*")* 2F n - ? a ~X X , for^'-n a-x* 
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For linear distributions other than the rectangular this simplicity 
of expression is not possible. 

On Figures 4 and 5 are shown graphs of the curves y = (h x) 

for several values of n for the rectangular and for the right trian- 
gular distribution respectively. Here again the interval a has 
been taken as unity and change in variable has been made to place 

all curves to a common base. The values of F n ( n x)ma.y be read 

directly from the same curves, since ( n • x") = 1 - F n ( n %') for 

corresponding values of X' and x" . Finally, on Figures 6 and 7 

are shown curves for the sum (x"j\ plotted as a func- 

tion of f ), subject to the restriction that f n (*>')- f n (x") 9 for 

several values of n for the rectangular and for the right triangu- 
lar distribution respectively. The values for Figure 6 were com- 
puted directly from the formula given in the paragraph above. For 
Figure 7, however, the values were derived graphically from Fig- 
ures 2 and 5, 

Figures 6 and 7 are applicable immediately to practical prob- 
lems. As a simple example, suppose there is at hand a group whose 
individuals are known to lie within the limits of D and X)+Cl and 
to follow a right triangular distribution with the larger probability 
associated with the larger limit, and it is desired to know for the 
sum of eight selections what limits may be expected to be associated 
with a probability value of 0.01. Referring to Figure 7 it is seen 
that the curve for n = 0 reaches an ordinate value of 0.01 at an 
abscissa value of approximately 0.45. Referring now to Figure 2, 
the distance 0.45 is fitted in between the two legs of the curve 

y I y it 

ior n= Q , and values for -q and ~q of 0.43 and 0.88 are found. 

Consequently it can be concluded that for sums of eight selections 
from this group the probability is 0 01 that the values will lie out- 
side the interval 6D+ 3.44a, to <9D + ~J 04a . 
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FIGURC-% 

ORAPH3 or curves y » r^c-n x) ran rectafkjuuar distribution 



FIGURE -5 

GRAPHS OF CURVES y FORNlBHT TRIANOUl. AR CuSTRlBUlT ION 
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owaphs Of curves of [r»(*r) • f n (V^ v». [Vj|f] row sect anojjlar Disrpusunov 
row CONDITION THAT {* (l. ) >f*( V) 



rwunc -*r 

IJAANH3 or CURVES Of [r* C* J * f« {*")] v. £-\^} FOR BiCHT TRIANGULAR DISTRIBUTION 
PO« condition That ■ {,(**) 
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While this study has been concerned primarily with the linear 
distribution, it is obvious that the results may find occasional wider 

application. The curves for f z (x) , \(x) , etc., are of a charac- 
ter suggestive of distributions that might occur not infrequently 
in engineering and physics. If any one of them, say f fe ) , is 
found to fit the group at hand with practical accuracy, then the 
sequence f n (*0 , £ flO* etc * clearly will give the distri- 

bution curves associated with the sums of one, two, three, etc., se- 
lections from this group. 



ON THE ELIMINATION OF SYSTEMATIC ERRORS 
DUE TO GROUPING 


By 

John R. Abernethy 

In the calculation of the moments of a frequency distribution it 
is often desirable, or even necessary, to consider not the distribution 
itself but another derived from it by certain groupings. As a first 
approximation to the moments of the original distribution we take 
the corresponding moment of the grouped distribution. But this first 
approximation is not satisfactory, and it is necessary to obtain some 
method for the elimination of part of the error committed in replac- 
ing the moments of the original distribution by the corresponding 
moments of the grouped distribution. 

This problem was first discussed by W. F. Sheppard in a paper : 
On the Calculation of the most Probable Value of Frequency-Con- 
stants, for Data arranged according to Equidistant Divisions of a 
Scale } If we denote the n-th moment of the original distribution 
by /x n and the n-th moment of the grouped distribution by y , 
we will have Sheppard's corrections in the form : 

r-z = "iz • 

M3 aV 3> 

^4 = V 4 ‘I ^140 ’ 

Ms bx/ 5 * 6 ^3 > etc- 

As pointed out by Karl Pearson 1 2 the hypotheses under which 
these formulae have been obtained are: (a) that Taylor's theorem 

1 Proceedings London Mathematical Society, Vol. 29, p. 353-380. 

2 On an elementary proof of Sheppard’s formulae for correcting raw 
moments and othtr allied points, editorial in Biometnca, Vol. 3, p. 308-312. 
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mav 1 * applied to the frequency function throughout the iange ; {h) 
is finite and continuous throughout the range , (c) f(x) 

and its derivatives vanish at the limits of the range These hypothe- 
ses are not always satisfied by the frequency functions with which 
the statistician has to work , and as it is impossible to tell before cal- 
culating the moments of a distribution whether the cot responding 
theoretical frequency function satisfies these conditions, it is desir- 
able to study the pioblcm from another standpoint 

A comparison of the title of Sheppard’s paper and the paper 
itself suggests the question, in what sense do Sheppard’s foimulae 
give the most probable value of the moments of a distribution p A 
partial answer is given by B. L. Shook in the Synopti* of Elemcntarv 
Mathematical Statistics d Miss Shook piesents 4 the formulae 

>VX-°» ) JL z = V z'h( i -~ m*)> and for a discrete dis- 

tribution with m values of the variable giouped in each class interval 
and shows that for a particular distribution these formulae serve 
to eliminate the systematic enors from M t ja z , and . Two 
problems are suggested by the synopsis : the derivation of formulae 
for the class of discrete distributions, as these three formulae are 
stated without proof the proof that this larger set of formulae and 
those of Sheppard do serve under all conditions to eliminate the 
systematic error due to grouping, subject only to the existence of 
the moments involved. When we have solved these two problems, 
we shall be m position to understand the true nature of the approxi- 

3 The Annals of Mathematical Statistics, Vol 1 , p 34-40 

4 These formulae are only special cases of a more general formula stated 
by H. C. Carver m an editorial Annals of Mathematical Statistics, 
Vol. 1; formula (14), p. 111 

5 Two methods of developing* this formula suggest themselves: (a) the 

elimination of the moment of a continuous graduating function expressed in 
terms of fine groupings of class intervals of on the one hand and in 

the terms of coarse groupings of unit class intervals on the other, (b) by a 
process similar to that of Sheppard, employing for example Lubbock’s form- 
ula instead of the Euler-Maclaurin sum formula. According to a statement 
made by Professor Carver, the formulae in question were derived by the 
latter process. 
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mation involved in employing Sheppard's corrections and correc- 
tions similar to them for discrete distributions. 

The problem we wish to consider is this Gnen the probabilities 
(pf;^) that a value of the statistical variable x taken at random 


v> ill fall within the interval Xi - 1 < x < x ± ~ , we wish to find 

the moments of the distribution We conside r this problem for two 
classes of disti ibutions * the distribution of a discrete variable ; the 
distribution hi a continuous variable. In either case we shall work 
with the uni-frequency distributional function f(x) . For the dis- 


crete distribution f ( ~ ) represents the probability that a value 


taken at random will he the number ~r ; rrt denotes a definite 


m 


of x 

positive integer, ( j * 0, 1, 2, • • F° r ^ ie continuous 

r b 

distribution / f (x)dx represents the probability that a value of ^ 
a 

taken at random will fall within the interval a<x<b Thus the 
function f (x) has the value zero outside the range of the distribu- 
tion and we may for the sake of convenience denote the limits of 
summation and integration as ± oo . For the n-th moment about 


the origin we have. 



for the discrete distribution , and 

f COd 

- qO 

for the continuous distribution. What we want is the value of 
What we are able to find is the value of 

= £ (V CPK>. 

L “-0 0 
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In establishing approximate relations between the set of true 
moments anc * the set of raw moments j'-v/J j we shall 

employ another set of statistical constants jv^ ■ . For the discrete 
distribution there are m distinct sets of groupings that can be made, 
leading to m values of the raw moment V p ; V n is used to repre- 
sent the average of these. Similarly for the continuous distribution, 

— . i i 

V n is used to denote the average of the moments v n correspond- 
ing to % 1 = l+t for all values of t satisfying 0 £ t < 1. We shall 

call this intermediate set of statistical constants |"V n the average 
grouped moments of the distribution. We then divide the problem 
into two parts. First we seek the expression of in terms of the 

\/ n | . Secondly we seek the nature of approximation in replacing 

by V n . The first of these can be solved completely without 
approximation and without any assumption other than the existence 
of the moments involved. We can best understand the nature o,f 
the approximation involved in the second after the first of our two 
problems has been solved. 

The rn values of V' corresponding to the m distinct methods 
of grouping a discrete distribution are given by 


■ +0 ° „ i n m-1 . i 

v n '(t).L a + t + ®*) z iidAEy.muox 

i=-o0 j*0 
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We shall first express the average grouped moment *1^ in terms of 
the true moments , and then solve for the in terms 

of the j\^ n | . We wish to arrange the right hand side of the above 

equation according to values of the argument X appearing in f (x) ; 
we therefore let .s^mi+k + j . This equation then becomes 



m-l-Zh n l r 
zm J m^rn' 


from which, by means of the binomial theorem, we obtain 




We therefore have 

(1) b i (m) ^n-i • 


where 



We shall sometimes write b t instead of b; (m) in order to simplify 
the expression of an equation The change of order of summation 
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ih based on the assumption that the m summations (t) converge 

absolutely, an assumption equivalent to that of the existence of )d' n 
since f (x,) has only positive 01 zero values. 

We see immediately fiom (2) that Cm) * 0, since the 

terms of the summation cancel each other in paiis, with the possible 

addition or a middle term equal to zero. The calculation of b^(m) 

may readily he effected hy means ot tne Euler-Maclaurin sum 
formula" 


m-i r m o-o 

£ gO+ {yj g(tjdt+S 

J=0 * J o 1 = 1 


Pai .. 

_4 i (20 1 


(Zi-i) 

g 



whei e 


3V 

1 , 

D z = 

i 


7 

15 ’ 

Dft- 

21 1 

*P\ 

127 

d 6 = 

15 ' 


We substitute 

gft)- Jtoi 


6 See for example Norlund’s Diffcren?enrechnung, Berlin 1924, espe- 
cially formulae (39), p 27; (42 1, p. 28, and (49), p. 30. Formula (39) is 

£J?)D n . L -i^-l)Vr)IWO for n>l,D 0 -l. 


From thk we ma\ obtain the \alues of D • and show that D,. ,=0 ; also 

Z c Z i. *+-1 

n £r»+l 

wc obtain £! ( ) D > =0 tor n >0 which we shall employ in the 

1=0 


proof of our formula (7). 
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obtaining 


(3) 




^ /£K+l -\ 

4 K f2k+l) m* 1 


The first few values are : 


b 0 (m) = l, 


b 2 (m)-' lz (i-^ z ), 




10 


), 


b, (m)« ]4 t 77 
* 1344 


21 49 31 


(3- ~j + 
rrr m 


nr 


). 




60 294 _ 620 


m 


381 


m c 


)■ 


A control on the values of b • (m) may be obtained by substituting 
m- 1 ; then all except b 0 vanish as b. ( 1 ) = 0 , for i > 0 . 

Having in (1) and (3) obtained the expression of the average 
grouped moment of a discrete distribution m terms of the true mo- 
ments, we wish to solve for the true moments in terms of the average 
grouped moments. We shall obtain this solution by the method of 
undetermined coefficients. Let 


(4) 


, I 


n 


- £ 
n j=o 


(i)A n ,Vi 


Substituting this in ( 1 ) we shall have 
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from which we obtain 

,i n 




by a change of order of summation effected by applying the Dirchlet 
sum 'formula 7 

n n-i. n n-j 

£ £ NV(i,j)-i: tl wflj). 

Uo j=o j=o i“0 


Equating coefficients of Vj gives us the recurring formula 
k 

E ( , ) h L A k L = 0 -for k v 0 , 

1=0 

together with the initial condition A 0 = 1* This may also be written 


( 5 ) 



k£ i; A 0 = i . 


Ordinarily in an expression such as ^(4) we would have written 
A n „j (r\) instead of A n _ j ; had we done so in this case, we would 
now drop the functional expression as we have shown that the value 
of A n _j (n) depending only on n-j is completely independent 
of n . The coefficients A n • are also independent of the position 
of the origin since if in 


M' n _ h = S (?)hV n _ L 


T For the method of derivation of this formula see, for example, Steffen- 
sen’s Interpolation (Baltimore, 1927), p. 91-92. 
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we substitute 


M 


r 


r\-i'X 


n-L 

j=o 


n4 

J 




r 


P6 J 


we shall have 


j 

' LL n.x.+ h 



and hence 


Mn:*+h = f? 0 ( j 1 A i v n-i-#+h 


If in (5) we substitute k * 1 we shall obtain A ( - b, 0 . 

Moreover in general A zX^l * 0, since by induction if 

^ sA 3 a,, ‘-' A Zi-l X °* 

the terms of the summation (5) will have respectively the zeio 
factors 

b 2i+i * A v b Zi- 1 ’ A 3'-- b y A zi-v b r 

Also from (5) we obtain: 

A z‘ ~ b * 

A 4 = * b 4 + b(b z ) 2 , 

A e = -b 6 + 30b z b 4 -90('b z ) 3 , ^ A 

A a *-b Q + 5e>b z b 6 + 70(b 4 ) z -iZ60(b z ) b 4 + 2520(b z ) . 
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An observation of these expressions of the As suggests the formula : 

y (-l) K (^0!k!(b z ) ai (b 4 ) Q4 ..-(b^) Q) 
A ZL*/L(Z)) a 1(41 ) a * • • (CZj)') ai (a^)(a z \ ). .. (ajt ) 


where U = +• a z , . aj , the summation extending over all 

positive integral 01 zero values of Oj , , dj satisfying 

a i -i- 2a z + *4 ja j s i . That this formula holds in general may be 
proved by induction: assume it true for l « 0, 1, *•* ]- 1 and 
substitute in (5) for U = 2 j . Upon collecting terms according to 
products of the b's we shall have established this formula also for 
i * j , and hence for every positive integral value. 

If in the expressions of the A s in terms of the b's we sub- 
stitute the values of the b’s in terms of m, we shall obtain the 
expression of the A’s in terms of m , Thus we have 


A Q = 1. 


A 


1 


). 


< 6) A -,40 


m- 
10 


(7- — * + 


Afc "154-4 


z 
4£ 
m 2 


A 

21 

m 4 


A»=- 


C 361- 


620 


m 

294 
m 4 




60 


)■ 


d 11520 ' m 2 m- m° m' 

A comparison of the values of A 0 with b D pof A z with , of A 4 
with b 4 ,of A^with b 6 , and of A e with shows a remarkable 
similarity between the coefficients in A 2l and those in ; in fact 

we observe that 


A z i = 


m 


Zl 


Substituting A for 


m ,in (3) and dividing by m 2 \ we obtain 
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(7) 


A Zk (m)- 


1 

A k (z k+i; 


£ fZK + l\ P 2k-2i 

i.x o \ 2 i + 1 / 


In order to prove that (7) is true in general, we assume it true up 
to a certain point and prove it true for the next highest value of k . 
That is we assume 


Azi ~ 


m 


zi b z t ( m) for L = 0 - 1 > 


and substitute in 


/A 


ZH 



b 2k-2L At 


another form of (5) since’ = ^ rom 0) we have 


and 


1 k ' L . , T) 

b = — y ( 2k-zi+i\j2j. 

ZH-2i (Zk-Zi + 1) Js0 ' z J ' ’ 


d 


'Z l 


4 l Ui + l) 


2 (Tr V 


21+14 


r=o 


m 


Z\-Zr 


After this substitution we arrange the terms of according to 
powers of ^ obtaining 



2 K+i 
2s+i 




1 

(2k-£$+l) 



k-25+1 

Zr 


> 


2r 


> 


where 5 - i - n j . When 5= 0, 1, k-1 the summation 

extends from r= 0 to r « k- s for ] ^ 0 , but from r - O to 
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r = k-s- i for j = 0 . Since 
k-s 


lC k T>^o. 


for 5 < k i the summation as to r gives zero for j ^ O but 

-(Zk-2s + l)D Zk _ 2s , for j = 0. 

At the same time for j = 0 , the factor equals unity 

and we have the desired terms for 5 = 0, 1, k- 1 . 

For 5 = k we have constantly p * 0 , the summation as to j be- 
ing from j = 1 to j * k ; we therefore have 




at the same time that 


- .Zk-Zs+ 1 N 

?_.( K" 1 - 


(2k-Zs + l) 2r 

We therefore come again to formula (7) with L replaced by s . 
Hence formula (7) is true for every positive integral value of k , 
The first few values of the As have been calculated m (6), others 
may be easily obtained by substituting the value of the Eulerian 
numbers from some table of D 2 ‘ t - 8 

Formulae (4) and (7) give us the expression of the true mo- 
ment 




of a discreet distribution in terms of the set of average grouped 
moments 


Norlund, loc. cit, Tafel 4, p 458, gives the value up to Tl^q 
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• r , 

1 j- 0 B "» u 2m/ M m/ 2m/ 


Employing the particular values given in (6), we have the formula 

Xl * ^n' (2 Ki'bl) ^n-Z 4o (4 )( 7 -&* &K-4 

^ 8 - t ~ 1344.^ + 7n^)^n-fe 


A / 620 £?4 QO 5 y 

*11520 ® m 2 ^ n-6 


For any particular value of n , this series terminates and we may 
therefore apply the ordinary theory of limits to (8), Thus we 
obtain 


( 9 ) 


Mn ~ *^n ~ ±z(z)^n-Z + Z40 



51 

1344- 


rj<- 


, m 

6 3040 



4*00 


the expression of the true moment * J x n *f of a 

- 00 

continuous distribution in terms of the set of average grouped mo- 
ments 

+ Q0 , +*p 


M 


J ( / <p c*+ 

- 00 


y )dx; =/ x 1 cp (x.) dx: . 
-00 


We have thus completely solved the first of our two problems ; we 
have obtained the expression of the true moments in terms of the 
average grouped moments without any assumption other than the 
existence of . The existence of requires the convergence 
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of the summation or integration as the lower limit approaches -00 
and as the upper limit approaches 4 00 independently. 

If m (8) we replace 




+ca 

■£ ■ 
j = -CO 


m i 1 * 

m V m 


m zm 


) 0(1^ \ 
' ^ m ^ m / 


by 


m -2: (*j) 

V j - - 00 J 


we will have the general Sheppaid-Carver formula. Since there is 
no approximation involved in (8), any error in the Sheppard-Car- 
ver formulae must be a result of the error involved in replacing 
the average grouped moments "V; by the raw moments 1 /; 1 . By 

_ I 1 u 

definition v* is the average of the 


- 1 4<kp L 

"Vj ft ) * C (j+'O (p Cj4t) , 

j=-oo 


and, therefoie, if we take any particular grouping at random, 

j), 

J— J 


is the mean of a random sample of one from the parent distribution 
V - 1 (t ) and hence the most probable value of “V^. The Shep- 
pard-Carver formula, therefore, gives the most probable value of 
the true moment ju’ n of a discrete distribution in the sense that 
these formulae eliminate the systematic errors due to grouping. 

Similarly, we shall obtain Sheppard’s corrections if in (9) we 
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replace 

+60 

=J x L <P(x)dv , 

-oo 

by 

I +< * > L 

v[ = £ (*•) (pM. 

These formulae give the most probable value of the true moments 
for a continuous distribution in the same sense as do the 
Sheppard-Carver formulae for a discrete distribution. 

The Sheppard corrections for continuous distributions and the 
Sheppard-Carver corrections for discrete distributions give the most 
probable value of the true moments >j.' n | of a distribution f (&) 
in the sense that they give an approximate value for ^ which is 
correct on the average. That is these formulae eliminate the sys- 
tematic errors due to grouping whatever the distributional function 
f (x) so long as the moments under consideration exist. While it is 
true that the accidental errors not accounted for in these corrections 
may not be negligible, these formulae do give the most probable value 
of ja ! n for a particular grouping and hence have a basis for uni- 
versal application. 



ON MULTIPLE AND PARTIAL CORRELATION 
COEFFICIENTS OF A CERTAIN SEOUENCE 
OF SUMS 

By 

Carl H. Fischer 

In a recent paper* the writer considered a sequence of q vari- 
ables defined as follows : The first variable, , is defined as the 
sum of n, values pf a variable, t , drawn at random from a pop- 
ulation characterized by a rather arbitrary continuous probability 
function, f(f) . Each succeeding variable, x> it (i >1), is defined 
as the sum of j? values of t drawn at random from the 
values composing , plus the sum of * values of t drawn 

at random from the parent population, 

For variables thus defined, it was proved that the correlation 
coefficient between any two consecutive sums, is 

independent of the probability function, f (t ) , and is given by 


( 1 ) 


r 
X, X 


: 1 , 1+1 


1 1+1 ( n i n l +i ) 


Vi 


It was further shown that the correlation coefficient between 
two variables not consecutive in the sequence is equal to the pro- 
duct of the respective coefficients of correlation between all inter- 
mediate pairs of consecutive variables. Thus, the coefficient of 
correlation between x j and is 


or, in a simpler notation, 


( 2 ) 


r jp = r j, j+i* r j+i,i+i ' ' ' ' r p-2, p-i‘ r P'i, p • 


* On Correlation Surfaces of Sums with a Certain Number of Rapdom 
Elements in Common, Annals of Mathematical Statistics, Vol. IV, pp, 103- 
126, May, 193 3 
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Let us now determine the multiple and partial correlation co- 
efficients existing among a sequence of variables thus defined. 

Consider the fundamental symmetric determinant, P , which, 
with its various co-factors, appears m the standard formulas for 
multiple and partial correlation coefficients. 1 * If we substitute for 


each r l]t (}£ 


1), in 1? 

, its equivalent from equation 

(2), we have 








i 


r u 

r r 

1Z 23 

r 0 

12 13 

hM 


r u 


1 

*23 


*23 S 4 

Hq 


1z I 3 


r Z 3 

1 ' 


4 ^4 

Hq 

(3)1?= 

r p r 
tt *3 34 

r r 

r 34 





r r * 
r iz r Z3 

• r q-z,q-l 

r * * r 
'll q- 

J,q-1 *34 


■1 ’ 

Hq 


r r 

•Hq 

p . . r 
r Z3 q- 

•i,q l 34' 

3rVi 


1 


Multiply the second row of P by and subtract it from the first 
row. Now multiply the third row by r^ 3 and subtract it from the 
second row Continue this process, multiplying the j-th row by 
r, i i and subtracting this row from the (j-l)strow, until all 
possible rows have been so treated. Equation (3) may now be 
written 

i H. L. Rietz, “Mathematical Statistics”, Carus Monograph No. 4, pp. 
94-100. 



280 


ON MVUfPLE AND PARTI A l COEFFICIENTS 



0 

0 

0 

0 

r^Cl-qp 

(K'A 

0 

0 

0 

P » r »( 1 - r 34 ) 

r (i-r z ) 

■ 

0 

0 



r M (l-r* 5 ) 

0 

0 

•ufo - ^-2 ( q-i Aq l,q) 

r ia EfZ,q4 

1+ r q 2,q-l( i " r q-i..q.' 



r Z3 $-1,4 

r 34* 

* 


The expansion of 13 may now be readily accomplished. The" 
application of this same method of procedure to each of the various. 

, (where T3 t j is the co-factor of the element ), yields 
without difficulty the results made use of in the remainder of this 
paper. 

A. Multiple Correlation Coefficients. 

The formula for the multiple correlation coefficient of one 
variable on the remaining q-1 variables is 

/1-T3 

(5) 1 + 1 , • 4 

From equations (3) and (4) we derive the following expressions 
for the necessary 12 and 1?^ , (j = i, 2.,3, • • • q). 
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( 6 ) 




V ^Xl-sfXl-s/l -fl-rVwXl-r*!,,); 


’V a ' r u r j XlTaJ Xi-r4 )• • • ) ; 

^33*^'^ ^' r A r 34^ 1 ‘ , i^ 1 ‘ ’ (l' r q-2 1 q.lX*' l q i -l 1 q)i 


’’ (^~ ^q-^q-i r q-lj q ) » 


^q-i,q-l“ (^~ t iz^' r z3 )^ _[ cf-2 1 q-l r q4,q)' 

■ • ‘ •(■ l ‘ r £s,q-z)( L - r q-2,q-l)- 

Upon substituting the proper values from (6) in formula (5) 
for the multiple correlation coefficients of the first and last varia- 
bles, respectively, on the others in the sequence, we find 


(7) 

*i- 234 

II 

to 

(8) 

r q- 123- 

•q-l= r q-l,q 
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The multiple correlation coefficient of any other variable, x,j , on 
the remaining q-I variables is given by 


J* 1Z3 ’ q 


( 9 ) 


z 






It is to be noted that the right member of (9) is independent 
of all of the simple correlation coefficients except rj^ ^ and - 


B . Partial Correlation Coefficients . 

The formula for the partial correlation coefficient between 
any two variables is 

(10) r iJ.lZ34- * pjT 

From equation (3) we derive the following expressions for the 
co-factors of elements other than those of the principal diagonal 
of the determinant. Because of the symmetry of the fundamental 
determinant, we know that -T2^ ; hence in expanding the 

co-factors of the elements of each row, we shall consider only the 
T2- where i i j . 

1. The co-factors of the elements of the first row are 
^ “^12 = r iz 0- _t Z3 '(i' r q-2 ) q-i)fi' r q 2 * * '-l ] q); 


2. The co-factors of the elements of the second row aie 

(12) '^23 = ^' r i 2 Fzi )■ 6-’ r q-z,q l)0-" r q-l,q )» 

^i l0 . (i- 4,5,6,..- .q). 
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3. The co-factors of the elements of the j-th row are 

(U) ~”®j, j+l ' C 1_r i2 )6' l 23)" , ^' l J-l,j)!j,i+l(^' l j+l 1 ]+2) 

^ji. =°. ( U i + 2 ’ l j+ 3 , q) 

4. The co-factor of the last element of the q-f h row is 

(!4) ""Rcj-i>q s (^~ r i z)(^ mjr zsr) ' * 

We see at once that all partial correlation coefficients between 
non-consecutive variables vanish, as each co-factor 12 = 0 if 

L^j-1 ,j , j+1 . The non-vanishing coefficients, those be- 
tween consecutive variables, are given below. 

X 

(IS) r = r I JLlAl 

K ’ l2- 345- q r XZ 


(16) r jj+i*i234 q s 


(Hf r zl \ 


r JJ*iXi-<3f i+1 r^ ljl+2 )J 

4 


O 7 ) r q-l,Q-i234-”-q--2 = r q'l,q 




(i-fq-^q-l P q-l,q ) 


From (16) we can state that in general the partial correlation 
coefficient of consecutive sums Xj andxj.^ is independent of all 
simple correlation coefficients except j , r* J+: p and * 
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C. Summary 

To summarize, we have shown that 

1 The multiple correlation coefficient of a variable # • 
on the remaining variables ot our sequence is inde- 
pendent of all simple correlation coefficients except 
those between ^ and the immediately preceding and 
the immediately following variables, respectively 

2, The partial correlation coefficients between all pairs 
of non-consecutive variables in our sequence are zero ; 
a result that appeals to the intuition when it is recalled 
that we are eliminating the effect of the variables that 
form the connecting links between the two under con- 
sideration. 

3. The partial correlation coefficient of any pair of con- 
secutive variables, and #j +1 , is independent of all 
simple correlation coefficients except those between 
the two consecutne variables in question, between the 
first of these and the variable immediately preceding 
it, and between the second of the pail and the variable 
immediately following it. 


CjuX H . 



AN EXPERIMENT REGARDING THE xl TEST 

By 

Selby Robinson* 

L Introduction 

R. A. Fisher has proposed that in case the hypothesis being 
tested has been partially obtained from the data, the Elderton table 1 
for % should be entered with n 1 equal to, not the number of fre- 
quency classes, but the number of frequency classes minus the 
number of statistics computed from the data. It has been proved 
under certain restrictions that this theory holds in the limit as the 
size of the sample approaches infinity. 2 

For samples of moderate size our only guide is experimental 
evidence, which indicates that Fisher's method is satisfactory in 
practice. 3 This is true in particular of the evidence presented m 
the present paper which describes a coin tossing experiment sug- 
gested by Professor H. L Rietz. 

2. The Experiment 

The experimental work here considered was done by seventy 
students each of whom tossed seven coins 128 times.* In any 
one of the seventy experiments, this results in a frequency distri- 
bution of 128 items divided into eight frequency classes But we 
lumped together the clashes of zero heads and one head and like- 
wise for six heads and seven heads, so that we had six frequency 
classes. If for every coin on every throw the probability of heads 

* National Research Fellow. 

1 Karl Pearson, Tables for statisticians and biometricians, (1914), 
Table XII. 

2 J. Neyman and E. S, Pearson, Biometrika. V. 20A (1928), pp. 263- 
294 

a Yule, Journal of the Royal Statistical Society, V. 85, pp. 95-104; 
Brownlee, ibid V 87, p 76 ; Neynam and Pearson, Loc. Cit, ; Sheppard, 
Phil, Trans., A. V. t 228, p. 115. 
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is one-half, the expected numbers in the six frequency classes 
are 8, 21, 35, 35, 21, 8. The divergence of the actual distribution 
n i ’ n l * • * n 6 f ^ rom theoretical one is measured by 



If we had possessed only one sample of 128 throws, we would have 
looked in Elderton’s table with the value of X and with rV « 6 
(the number of frequency classes) to find the probability P^ that 
a sample would by chance deviate from the expected distribution 
so much as the actual sample had deviated. But having seventy 
samples, we compared the distribution of our seventy values of X 
with that expected from n - 6 . The arithmetic mean of our 
seventy values of was 4.62 whereas the expected value was 
five ; a deviation which could very well occur by chance. So our 
results are consistent with the hypothesis that the probability of a 
head is always one-half. 

We considered next the following composite hypothesis : the 
probability p of heads is the same for all coins on all throws. For 
any sample of 128 throws, we took as the estimate of p the actual 
proportion of heads among the 7 x 128 possibilities. From this 
value we calculated the expected frequencies, r n , . . . , m^. For 
each of the seventy samples, we computed 

1-6 9 

x z = 22 — - mi 

1 Ui 

When we used the test to compare this distribution of X,% 
with that expected for n* - 6 , we found that : 01- But 

when we compared our distribution with that expected for 
rV -6 ~ i =3 we found that P^ = .S£, The mean value of our val- 
ues of Xj was 3,97 compared with 6-1-1= 4demanded by Fish- 
er's theory, and with five by the theory that is distributed aa 
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yJ 4 , If the latter theory were correct the probability of the mean 
of seventy values of X ( being so far away from five, is 007. 
That our distribution of corresponds to that expected for 
rV = 5 whereas our distribution of the values of corresponds 
to n 1 = 6 , can be seen from the following table. 4 


Values of X ^ 
(or Xf). 

Observed 

frequency 

ofX* 

- 

Expected 

frequency 

n'*6 

Observed 
frequency 
of X, 

Expected 

■frequency 

n=5 

0-1 

3 

26 

7 

6 3 

1-2 

8 


12 

12.2 

2-3 

11 

■H 

14 

12,5 

3-4 

14 

10 5 

12 

10.6 

4— S 

12 

9.3 

10 

8.3 

5-7 

11 

13 7 

7 

10.6 

7-9 

5 

7.8 

3 

5,2 

greater 
than 9 

6 

7,6 

5 

4.3 


4 In computing we combined the first two classes ot this table 
and also the last two, thus making n' * 6 . 






ON SAMPLING FROM COMPOUND POPULATIONS* 

By 

GEORGE MIDDLETON BROWN 
Introduction, 

The decided asymmetry or the multimodality of certain fre- 
quency distributions may have prompted the idea of the possibility 
of the existence of frequency curves, apparently single in charac- 
ter, but which, on further investigation, might be shown to be 
actually composite. In other words, apparently homogeneous ma- 
terial may prove to be heterogeneous, or divisible into two or more 
distinct homogeneous groups 

The above ideas lead naturally to the problem of dissecting a 
compound frequency function into its various components. Karl 
Pearson 1 successfully solved such a problem, using the method of 
moments, on the assumption that the compound parent population 
was composed of two normal components Each component curve 
has three parameters, the mean (or position of axis), the standard 
deviation, and the area (or total frequency). One requires there- 
fore, six relations between the parameters of the gnen compound 
frequency curve, and those of its two components, in order to deter- 
mine six unknowns. The ultimate solution of the problem turns on 
the determination of the zeros of a nonic equation, the location of 
whose real roots is obtained, to successive approximations, by 
means of the so-called Sturm’s functions. 

The dissection problem was taken up later, first in a paper by 
Charlier/ then in a joint paper by Charlier and Wicksell 3 who 


* A dissertation submitted in partial fulfillment of the requirements fox* 
the degree of Doctor of Science in the University of Michigan. June, 1933 

1 On the dissection ot frequency curves into normal curves Karl Pear- 
son. Phil. Trans. Roy, Soc. Lond Vol 185, Pt. i, pp 71-110. 1894A. 

- Researches into the theory of probability. C. V. L. Charlier. Medtlel- 
anden frau Lunds Astron Observ. Sec 2, Bd 1 1906 

3 On the dissection of frequency functions. C, V. L. Charlier, and S D 
Wicksell, Arkiv. fur Matematik, Astron. och Fysik. (Medde)au<te') Band 
18. No 6 1923. 
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considerably simplified the theory, finally ai riving, however, at the 
fundamental nonic due to Pea i son, for the solution of which, they 
suggested the use of a graphical method. They also studied special 
cases of the more general problem, e g the means of the two com- 
ponents assumed known, the compound curve assumed symmetri- 
cal, or the standard deviations of the two components supposed 
equal. In addition, they extended the problem to the case of fre- 
quency functions of two variates. 

In the present paper, I propose to investigate the sampling 
problem in the case of compound distribution functions, and trom 
a consideration of the dissection pioblem, one is led to a division 
of the present investigation into two mam paits, for the following 
reasons. 

On the one hand, in sampling from a compound population, 
if we do not know the proportion contributed to the total frequency 
of the sample by each of the two components of the parent popula- 
tion, we are essentially sampling from a single population. That is, 
random samples of N are drawn from a single composite parent 
population made up of two components Hence, the previously 
obtained results for sampling from a single parent population will 
be available if we derive expressions for the parameters of the 
compound parent in terms of the parameters of its components. 
This is done in Part 1. 

On the other hand, however, if we know the proportion con- 
tributed to the total frequency of a sample by each of the two 
components, the situation differs entirely from that studied in Part 
1, Here we are concerned with sampling from two distinct parent 
populations, and tn Part 2, I develop a method for dealing with 
this problem. Thus, in Part 2, it is assumed that samples of p 
and s respectively are drawn from two distinct parent popula- 
tions, and these two samples are then combined to yield a sample 
of r+ 3 s N from the combined populations 
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Therefore in Part 1, we are essentially sampling from a single 
parent population, whereas in Part 2, we are sampling from mul- 
tiple populations. 

The developments of Part 2 yield some new sampling results 
for sampling from two parent populations. In Section 6, I derive 
expressions for the semi-invariants “of moments about a fixed 
point” in samples from the compound frequem function, in terms 
of the corresponding semi-invariants of the moments of its compo- 
nents. In Section 7, expressions are derived for the semi-invariants 
of “moments in samples from the compound population about the 
mean of the combined sample,” in terms of r and 5, and the semi- 
invariants of the two components themselves. 

The occurrence of a certain class of well-known polynomials 
in the development of Section 1, is of especial interest, since these 
are, except perhaps for sign, the semi-invariants of the binomial 
distribution, and have some rather important properties, and their 
further study, although not pertinent to the problem in hand, should 
yield some very interesting results. 

Section S is devoted to the discussion of the case in which a 
limiting compound frequency function exists, under certain as- 
sumptions regarding the nature of its components, where the num- 
ber of the latter is allowed to increase indefinitely. This idea of a 
limit frequency function would appear to indicate the possibility 
of a new approach to the theory of frequency curves, in which the 
variable may now be a complete frequency distribution in itself. 

This investigation was begun on the suggestion of Professor 
C, C, Craig, of the University of Michigan, U. S. A. to whom I am 
indebted for constant inspiration and guidance during its pursuit. 
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PART 1 


Section 1 . The semi-invanonts of the compound frequency 
function } in terms of the scnn-mvct riant s of its two normal com- 
ponents . 


The main object of this first section, is to obtain expressions 
for the parameters of a compound population m terms of the para- 
meters of its two normal components, and to this end, I shall use 
the following definition of the semi-invariants of Thiele. 1 


t 3 


f(x) 


xt 

e d>£ . 


I write therefore 


i(x,)=p (P ( (x0+ q <P 2 (x). 


in which f (xi) is the compound frequency function, (f) (x), ($,&j)are 
its two normal components, and 1 


If L it L z , etc. are the semi-invariants of f(x), then 




(l)e 


Z * 2 ' 




where rq, rq, C , <J , are the means and standard deviations of 
^(Aj^^fxOrespectnely. For convenience, I write 

m z -m^a. <X,- a ± =b - p * r 

We wish to express the L m (1) m terms of the quantities nq, 
™ v <{> O z , p .°r q . 

Taking logarithms in ( 1 ) , 

1 Numerous references relating to the theory of scmi-nivai lants mav lie 
found at the end of “An application of Thiele’s semi-invariants to the sam- 
pling problem", C. C, Craig. Metron. Vol, 7, No 4, 1928, p. 73 



292 ON SAMPLING FROM COMPOUND POPULATIONS 


t Z t 3 

'-i t + L Z l\ + 3! + 


( 2 ) 


= foq p + + |-| )+ foq (l+re 2! ) 


We require now a suitable form for the expansion of the third 
term of the right member of (2) in successive powers of t . We 
have 

( 

■fog v 1+ re /= foq (1+ r)+ foq 



Further 



The complete representation of terms of the type 



in the right member of (3) will be 
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Therefore, the coefficient of 


yp in the tight member of (4) xu 


n i -n , n~i * i 
n! a J • b J * t J 

j = pji]. ^ n ‘ J - (2j-n)'.(n-j)i 


wheie l^— 1 
integer m 


means the largest 

n* 1 
£ ’ 


t 

Then, the coefficient of — in the right member of (3) is 

n k M ) i * l -q k {k\ n fa n-j L i 

(5) L = E C ; f Z : — - — — ;n>Z 

n k-iui k Z n J (2 ] -n)i (n-)) 1 


and this is the relation sought t jn which the sevni -invariants of the 
compound frequency function me expressed in terms of the semi- 
invariants of its two normal components Below, I have written 
out m detail the expressions for to inclusive, 

L t = m 1+ -aq. 

= + b ] 

Lj= a 3 pqq 3 + 3abpq . 

(6) b 4 “ a 4 pqq 4 +6a :1 bpqq 3 + 3b z pq. 

L 5 = a' S pqq 5 +-10d 3 bpqq 4 +-15ab z pqq 3 . 

L fc = a fa pqq b + 15a 4 'bpqq 5 +45a 1 b z pqq 4 -t-i5b 3 pqq 3 

l 7 = a 7 pq<q 7 +21a s bpqq 6 + 105a 3 ti l pqq 5 + .105ab a pqq^. 
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L g =a 8 pqq s +Z8a e ’bpqq 7 -*-El0a 4 b Z pqq 6 +4Z0a z b i pqq^l05ljpqq^ 
L 9 «a 9 pqq 9 +36a 7 bpqq fl +37fla 5 bVqq 7 +1^60a 3 b 3p qq 6 +945 Q t)'pqq 5 . 
in which 

q 3 -l-2q. 

q^“l-6q + 6q z . 

q s = 1- 14q + 36q 2 - 24q 3 . 

(7) 

q fe « i- 30q+ 150q z - 240q 3 + lZOq\ 

q 7 = l-6Zq+540q z - 1560q 3 + 1600q 4 - 7Z0q 5 . 

q 8 -l-lZ6q+i0O6q z -54OOq 3 +166OOq 4 -1512Oq' 5 +5O4Oq fe . 

q 9 « i - 254q +5796q*'-40824q a 4l2b000q 4 - 1915 20q 5 
+ l411Z0q fe -4 03Z0q 7 . 

The expressions for theL n in (6) have two properties, which 
enable one to write them down readily. In the first place, assuming 

that the polynomials in q(orp*l-q) are suppressed, i.e. q' * q - 

pq, q'^pqq^, q^pqq^, etc., are all set equal to unity, then the 

resulting functions in “a” and “b” are readily obtained by means of 
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a well-known recursion formula. Secondly, considering the poly- 
nomials cj,' K as coefficients in the several terms of the original com- 
plete expressions for the l_ n in (6), for rr>2 , and arranging 
these expressions so that their corresponding terms appear in col- 
umns, the first terms in the first column, the second terms in the 
second column, and so on, then every term in any diagonal array 
proceeding from upper left to lower right, and consisting of one 
and only one term from each of the expressions (6), will have the 
same polynomial coefficient. 

I proceed now to obtain expressions for the L n in (6), in 
which the individuality of the polynomials , q’ , , etc,, has 

been suppressed. This time I write 


( 8 ) 


foq(l 


aU% , 

+ re J - tocj 




The term in 


t'in 


t k / bt \k . 

wA a 'z) 15 


(2k-s)\(s-k)\Z s ~ h 

Rearranging the series |n brace of right hand member of (8) 
in successive powers of t , 


/ . f - f a zk-s b 5 - k .t s __ 1 

toqlltre * /* foj |1 + r .. 2 s '^k-s)l( S -k)l 


( 9 ) 


=. -foq(i+-r)-t- foq (l + @. 


, e/ 



2% ON S/IMPJ/Nr; /* KoAi COMF'n ND POPVLAI IONS 


where 0 L - 1*^3,* ♦ . 

4-S 

Therefore, the coefficient of in the series in the bract «n ihe 
right member of (9) is 0 5 , where 


( 10 ) 


s 



a!Q 2k-s b S-k 
Z 6 '* (2k-s)!Cs-k)l 


From equation (2) and (9) above, we have 


t ** t* 

L l t+ L Z Il 4 ' L 3ll 4 " 


( 11 ) 


» ioq p+- foqti+nl+^t )+ foq(l + e t t + ^ + 
Therefore, equation (11) becomes 


(12) L i t+L )J |i + ‘- - (m^t + ^j) 4- f oq (l +• Q^t +• 9 z '^i + •'')■ 


One might note, in passing, that in (12), the 0 s are playing the 
role of moments, if one recalls the definition of semi-invariants, 
so that it would be possible to write down a second general expres- 
sion for the L 5, using the well-known foimula for semi-invariants 
in terms of moments. 

The first six 0s take the following form 

Q 1 = qa 

0 Z = q(a z +b) 

0 3 = q.(a 3 +3ab). 

0 4 = q (a\ 6a* b + 3 b z ) 

% “ q(a- 5 +10a 3 b+-15 ab z ) 

0^ " a (a. 6 + 15a 4 b + A F in z h s, i 1 ii 3 ) 
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If, in the last set of relations, we set q« 1 , we then have 0 a = /S,, 
and if in the expressions (6), q\ , etc., be all set equal 

to unity, we shall get, for n > Z , 

L n - <a„ ■ 

I shall now show that the /2>5 follow the recursion law 


(13) 



/3 5 . 


Now, putting a tt ^Ls(p(t), we have 


f t e‘ W) -(a + M).e' PW 


and in general 

( 14 ) a+bt).e <f(t) 

dt 

where (x; f y) is a polynomial of degree * in ^ and y . 
It is easily shown that 


(15) e 


m d 


'dt 


P (b.a+bt) 


r fefb.a+btj-e^ 


t*o 


da i 


t*0 



298 ON SAMPLING FROM COMPOUND POPULATIONS 


and that 

d 

( 16 ) V ^-2 


a.P 5 - e 


m 


t=0 


t=o 


Now deriving the left member of (14) with respect to i , and then 
setting t =0 , gives the nextyS , namely yS 5 ^ > by definition, whilst 

the derivative of the right member of (14), and setting i-O, 
would equal the sum of the right members of (15) and (16), which 
establishes the proof. 

The second property of the expressions for the L h in (6), 
which requires proof, may be stated as a theorem thus — “The k-th 
polynomial coefficient in the expression for the semi-invariant 

L. , m>l> is identically equal to the l^t polynomial co- 

efficient in the expression for the semi-invariant 

For simplicity, 1 have considered the first and second poly- 
nomial coefficients of and respectively, the proof going 

through in exactly the same manner if perfectly general terms in 
these expressions were considered. 

From (5), suppose that n^2m (even). Then 


(mi 


2m 


*7 £ <j(k\ g 1 Ml a yZm b* m ~ J • t j 

£l£l k U JjZm 


The leading term in L Zm , i.e. the first term in /3 Zm , multiplied 
by a polynomial in q , is obtained from (17), by setting j =» Zm • 

k-1 i-i K \ » / 
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Therefore the polynomial coefficient of a lrn i.e. the leading co- 
efficient in L 0 is 


(18) 


k 


2 Z 

k=l l=1 


, n i+i >< 

(j - 1 • q 



2m 


Again, from (5), when n = 2m + i (odd), 


L 


‘£rml 


£rmi k 

-2 2 

k«i i*i 


W%Yl . 

k \lr/jann+l 2 am + l)-i . ( 2 j +1 - ] ] I 


The second term in obtained by setting j * Zrr \ . It is 

2m+i k M) lH . n k / Lf A , , , 

Z £ - - V — • ( k ) • mfcm+l). a 21 "' 1 - b • L . 

K*1 i=i 8 \ L / 


2m-i 


Therefore, the polynomial coefficient of a " • b i.e. the second 
coefficient in L^ m+ ^ is 


(18') 


2 £ +i £ f-l) iH -q k / 8 ^ 2m 

k«l Ui 8 \ i J 


Comparing (18) and (18'), which must be identically equal, if our 
theorem is true, it remains to show that 

2m+i 2m+l i+L 2m 

£ C -i .[ . ) i « 0. 

Zm+1 L»1 1 ; V t / 

but it is well known that this expression is identically zero - 


2 See Hall and Knight. Higher Algebra, p 259, Ex. 2. 
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Section 2 . A table of values of a certain class of poly- 
nomials in one variable , for different values of the argument . 

In order to facilitate the actual computation of the values for 
the semi-invariants L n , given in Section 1, in a particular appli- 
cation of the theory, when a* , and b- O* , are 

known, I consider the expressions for the L n , as they appear in 
the form indicated in (6). Now, when b *c^-c J^O, i,e. the two 
components have identical standard deviations, the set of relations 
(6) take the form 


(19) l^-iT^+aq. \l z *af + a Z pq 'and L? n - a n pqq n ; n>Z 


in which a . q Ai etc., have the same significance as in (7), Mak- 
ing use of the properties of the expressions (6), which were stated 
at the end of Section 1,'trom (19) we may write the for 
n > 4 , as follows 

= X + ^(a)^4 + ^(a) 


b\ 2 ^ 


X ", 

X 


u. L 


15 (I) ^ *5 (h/l s • 

\ X 2 \ 

q X(f )q.ios(gq+io5(|)q . 


6“ h> + 


(20) 


X 

v \ 


. I 


La~ ho +• 


zsH) q * 210(3) ‘VXX - Xa) 

X \ V 


a - 8 

N 


. 4 - 

u 


X .,_,z 


. I 


3=1 +■ 
and so on. 


36'g)L' 8t 37s(|)q + lZ6o(X t Xa) 


Therefore, for n > 4 , the general semi-invariants L p of (6) 
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may be expressed in terms of the special semi-invariants L* n , 
obtained from the former by setting b = 0 . From (20) in general 
we have 


n k = o 


e 


n~k 


(2k) 
n 

2 k -kl 



in which 


n-k 

n-k ~ a n-k > 


because 


L. 





n-k 

a PWn-k 



k-0 


n-Zk . k 
a • b 


n^i( b\ K 

' Z k - k! (a/ 
Z k . ki 4 n -k > 


and the last expression, for n >4, is the equivalent of the general 
expression (5) for the L^. 

Further, if we consider the terms in the expressions (20) as 
elements in a set of diagonal arrays, as I have indicated, it is evi- 
dent that, moving downwards along any particular diagonal, any 
term in this diagonal is obtained *rom the one immediately pre- 
ceding it, by the use of a multiplier The formula for the 

calculation of the , may be derived as follows. 

Consider any term of say L n > whose numerical coefficient is C^ n . 
Cet this be the(k-+-l)5t term. 

Then 

C k+l,n ‘ z k -k\ 
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Similarly, take the (k+2)ndterm of L mi , with numerical coeffi- 
cient C . . Then 
k+^n+1 

C 

k+Z,n+l 2 k+i - (k+i)! 

and we note that C , and C, „ .are the numerical co- 
k+i, n k+Zjn+l 

efficients of two adjacent terms in one of the diagonal arrays men- 
tioned above. Therefore 




(n+l) 


(Zk+2) 




C 

Z i n +1 


Cn+l) Cn- 2k) 

z( k+O 


c 

k+l ; n 


n ZZk 


and 


M 


k+l,n 


(rH-i)(n-Zk.) 

Z(k+l) 


It is of considerable interest to note, that the \l n of (19) (for 
o>Z ), are, except perhaps for sign, the product of the semi-in- 
variants A n (n>i) of the binomial distribution and appropriate 
powers of “a”. To show this, we need only consider the generating 


at 


function for the L!.^ , namely l4*q(e -l), and that for the A n , viz 
[ l + p(e -l)] » with 6-1 « Frisch 1 has obtained a recursion 

d , 


formula for the A n , which is 




dp 


n-1 


n>l 


1 Sur les semi-invariants et moments employes dans l’etudc des distri- 
butions statistiques. Ragnar Frisch. Skrifter utgitt av Det Norske Viden 
skaps-Akademi i Oslo. 1926. No. 3, Ch. 2, p. 29 
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Therefore, it is evident that the L n obey a corresponding iecur~ 
sion formula 

,* n d \ 

A* a pC L' dq L n-l n>2j 

In fact, the polynomials q^=pq, q‘ 3 = pqq , q[=pqq > etc., are the 
same functions of q as the A n are functions of p, i e. 


V VA 


Hn^nCq)- 


for n> Z. 


To investigate thoroughly the properties of the polynomials* 
q 1 , etc , would be irrelevant to the problem in hand, but, 
so far as I know, such a study has not been earned out, I will, 
however, mention a few of these properties, which appear inter- 
esting. 

1. The roots of the polynomial q* (for any n 'LZ ) are all real 
and distinct, and these roots all he in the interval (0, 1), zero 
and unity being roots of every polynomial, 

2. The roots of q' n separate the roots of q ; ^ 

3. The polynomials q 1 ^ , of even degree in q , are symmetrical 

with respect to the line , whilst those of odd degree in q , 

namely a 1 J , are symmetrical with respect to the pomt 

, i5n+l 

(/a, 0) 

4. An orthogonality property in (0, 1) holds if and m + n 


is odd. That is 
ri 


but 


q q 1 ,0. 


O q ^O. 

L m *-n 


m^n, m+n(odd) 
m^n, m+n(even) 


$ These same polynomials appear as functions of p in a paper by H 
C. Carver on "Fundamentals of the theory of Sampling" Amcr Statist 
A*>soc. Annals of Math. Statistics. Vol X, No. 1, Feb, 1930, p. 106. 
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5. Further 


n-1. 




h tr\ 
o 

in which B_ is the Bernoulli number of order 2n . 

In ^ 

0 

In calculating the actual values of the L n expressed in the 
form (20), it would obviously be very convenient to have at one’s 

disposal a table of values of the polynomials q l *p<q 

q 1 = pqq^, etc., for a range of values of q, since the latter, when 

multiplied by appropriate powers of “a”, are the ii^ of (19) and 

(20). I have, therefore, set up such tables, for values of the varia- 
ble q ranging from 01 to 10 inclusive, at intervals of .01 , 

It is to be observed that only functional values are recorded 

here for . 01 4 q£ , 50 , since we would merely repeat these values 

when 50^ q £ 1 0 > in the case of the polynomials a , of even 

degree, whilst in the case of those of odd degree, namely ^ > 
there would merely be a change of sign. For it is easily seen that, 
writing, q' n = A n (q) , 

\(<})“ (-OH ^ n (ph Cp«l)-i. 


Hence 


and 


^n (c lh\ n (P> 


^n+iW-’^Zn+l^* 


I have calculated the exact functional values of all the poly- 
nomials q 1 ^ , for and these values appear in the tables 
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for those cases in which n g. 4, but for n> 4 the functional 
values are written down coriect only to eight decimal places I\ach 
polynomial is set out in detail below . 

H' z =c i- c i Z: 
q ‘ 3 *<\-H z+Zc i 3 - 

q' 4 -q-7q*+Uq 3 -6q*. 

q^ = q-t5q*+50q 3 -60q 4 +24q 5 . 

q' - q-31 cf+ 180q 3 -390q 4 -+360q 5 -170q fo . 

q* . q- 63q z + &02q 3 -2l00q 4 +336Oq 5 -Z5ZOq b + 720q 7 

q» » q-127q Z +1932q 3 -10£06q 4 + Z5200q 5 - 3l920q 6 

+ 20l60q r -5040q e . 

q = q- £55q z + 6050q 3 - 466Z0q 4 +166«324q 5 

- 3175 £0 q 6 + 332640q 7 - 10144Oq fl + 403?0q 9 
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<1 


4s 

4a. 

<5 

,01 

0099 

.0097 02 

.0093 1194 

0085 4940 

.02 

0196 

.0188 16 

.0172 9504 

,0143 9048 

.03 

.0291 

0273 54 

0240 1914 

.0178 0198 

.04 

.0384 

0353 28 

.0295 5264 

.0190 4886 

,05 

.0475 

0427 50 

0339 6250 

.0183 8250 

.06 

.0564 

0496 32 

0373 1424 

0160 4106 

.07 

.0651 

.0559 86 

0396 7194 

.0122 4974 

,08 

.0736 

0618 24 

0410 9824 

.0072 2104 

.00 

.0819 

0671 58 

.0416 5434 

.0011 5512 

.10 

.0900 

.0720 00 

0414 0000 

- .0057 6000 

.11 

.0979 

.0763 62 

0403 9354 

- 0133 4808 

,12 

1056 

.0802 56 

0386 9184 

- 0214 4440 

.13 

1131 

0836 94 

0363 5034 

-.0298 9550 

,14 

.1204 

0866 88 

.0334 2304 

-.0385 5882 

.15 

.1275 

0892 50 

0299 6250 

— .0473 0250 

.16 

1344 

.0913 92 

0261) 1984 

- .0560 0502 

.17 

1411 

.0931 26 

.0216 4474 

-.0645 5494 

.18 

.1476 

0944 64 

0168 8544 

-.0728 5064 

.19 

.1539 

0954 18 

0117 8874 

-.0807 9996 

HiKI 

.1600 

.0960 00 

.0064 0000 

-.0883 2000 

.21 

.1659 

.0962 22 

0007 6314 

- .0953 3676 

.22 

1716 

.0960 96 

-.0050 7936 

-.1017 8488 

,23 

.1771 

.0956 34 

-.0110 8646 

-.1076 0738 

24 

1824- 

.0948 48 

-.0172 1856 

-.1127 5530 

.25 

,1875 

0937 50 

-.0234 3750 

-.1171 8750 

.26 

1924 

0923 52 

-.0297 0656 

— .1208 7030 

.27 

,1971 

.0906 66 

-.0359 9046 

— .1237 7722 


2016 

,0887 04 

-.0422 5536 

-.1258 8872 

,29 

.2059 

0864 78 

-.0484 6886 

-.1271 9184 


.2100 

.0840 00 

-.0546 0000 

-.1276 8000 

« 31 

2139 

0812 82 

- 0606 1926 

-.1273 5264 

.32 

.2176 

,0783 36 

- 0664 9856 

- 1262 1496 

.33 

,2211 

.0751 74 

- 0722 1126 

-.1242 7766 

.34 

.2244 

,0718 08 

-.0777 3216 

- 1215 5658 

.35 

2275 

.0682 50 

-.0830 3750 

- 1180 7250 

.36 

2304 

0645 12 

-.0881 0496 

-.1138 5078 

,37 

2331 

.0606 06 

-.0929 1366 

-.1089 2110 

38 

.2356 

.0565 44 

- .0974 4416 

— .1033 1720 

,39 

,2379 

0523 38 

- 1016 7846 

-.0970 7652 

.40 

.2400 

0480 00 

-.1056 0000 

- .0902 4000 

.41 

.2419 

.0435 42 

- 1091 9366 

-.0828 5172 

,42 

.2436 

0389 76 

-.1124 4576 

- .0749 5864 

.43 

,2451 

0343 14 

- 1153 4406 

-.0666 1034 

,44 

.2464 

0295 68 

- 1178 7776 

- .0578 5866 

,45 

,2475 

.0247 50 

-.1200 3750 

- .0487 5750 

.46 

2484 

.0198 72 

-.1218 1536 

- 0393 6246 

.47 

.2491 

,0149 46 

- 1232 0486 

- 0297 3058 

.48 

.2496 

.0099 84 

- 1242 0096 

- .0199 2008 

.49 

.2499 

.0049 98 

- .1248 0006 

- ,0099 9000 


imm 

.0000 00 

-.1250 0000 

.0000 0000 
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Section 3. Approximate expressions fot the semi-invariants 
of in samples from the compound frequency 

function . 

In the paper of C. C. Craig, aheady etted, the author obtained 
the following results for sampling from a single parent population. 

(1) Expressions 1 for the sampling characteristics of the correla- 
tion functions for , and 'b , in terms of N , the 

size of the sample, and the characteristics of the population 
itself. 

(2) Expressions' 2 for the sampling characteristics of the distribu- 
tion functions for , and in terms of certain “g” 

functions, the latter being defined by 


( 21 ) 


9k £ 


(V sj )= 

vm 1 n , km+tn 

\4 £ — ~ 


in which 5^^ (v^, vj^ ) are the characteristics of the corre- 
lation function for ♦ 

I can now make use of the results indicated above, in conjunc- 
tion with the relations (6) of the present paper, in order to deter- 
mine approximate expressions for the semi-invariants of , 

0^ , in samples from the compound frequency function, retaining 
only terms of order ~2 and higher in N in using expressions (1), 

and only those “g” functions in using the expressions (2) which 

♦ 1 

are of order - 2 and higher in N , where cj ^ is of order — 

A. The semi-invariants of oA , viz, h , b_ , b, , etc. 

& 1 4 0 

From definition (21) and the relations (1) for m-2, n* 3 , 
and making use of the following notation 


1 Loc. Cit p. 57 ct seq, 

2 Loc. Cit. p, 50 et seq. 



Ci.Oh'Li: VII'PLU ON KNOWN 


(22) <P.: 





3 IN 


< \r 


in which the are the same as in (6) of Section 1 ? 1 obtain the 
following set of <r g’* functions 

fco'fi { (N_1) } m,n « fo1 
%i -^[CN-1)(N-2)<P 3 J 

% "n-»[ (N ' 1)2<P4+ 

^[(N-i) z (N-Z)(p s+ 6NfN-lXN 2)(p,j 

(23) , , 

g- oz -P {fN-D^N-^^^NCN-OCN-Z) 2 ^^ 

9N(N-l)(M-£) 2 Cp 3 :l 4-6H z (N-lXN-2)]. 

<J M - i, J(N-t) 3 cP 6 +12 N(N-l) \+A - N ( N-iXN^j^+eN^N-ll 

^^ 6 [C^) 3 rN-2)<P 7 + 16 N(N- 1 ) 2 C'N-2)<p 5 

+i2N(N-l)(N-2X2M-3)(p 4 .3+46NXN-i)(N-2)4]- 

= ^ 7 [cN-l^M^+aNflN-O^N-Z) 2 ^ 

+ 6N (N-l)(N-2) 2 fe[S'll)4 ) 5 3 + S N (N-l) (M-2) 2 (3IN-5)(f 

+18 M 2 CrM-l) rN- z) Cb in -11 ) 4S N Z 'CN-lX0-2)C9 N-20) ^ 

+36N 3 ('lN-lXN-Z)| 
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9b3= fia jifM-i) ^IS-z) 3 <P 9 + 27 rt(N-i) Z fN-&) 3 $ 7 V27N (H-l)(H-2)Y3N-4)(| 3 
+27lN(l1-l)(N-2) 3 (4-^'/j<P 54 ,+ 54- li z ( hl-l)(N-Z) z (4r/- 1) (f 5 

+162 N YN-lXN-2) z f5N-12)(j) 43 t36N 7 N -30N+34>f 

+10<3N YN-lX^)(5N-i2)(P 3 1 . 


and on substituting these values for the “g”s in the expressions 
for the semi-invariants of o( 3 from the relations (2), I get: — 
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-513 ft -ifft - 810 V \ ft -loeft 6 -f ft 5 +1710 ft 3 


9 1) Q9ij)2i &Ij\ 27 j] 243 A3i 99 j)12 
8 + r%5 + 2-%6 + I^fa + y%4 + 


ft -0 

In a similar manner I obtain 
B, The semi-invariants of oC^ f viz c ± , c £ , c 3 , etc. 

In this case m=*2, n-A- , and the “g” functions this tune are 


' larhTevit y%r%t( v z'^) 
%t ^[fNrNU(N.6)4) 4+ 3N^2rt + l) . 

^ 1 =^[(N-^ fa+ 2(7N-25)cJ) 4t 6(N-5;cp; + lZ(N-2)] 

(24) ? o * = i ! f 4 ( N ' 8 H + 4 (™' 46 ^ 6 +48 ^' fl) ^ +2 ^ l7r< ' 12 ^C 

+ 1 2 f 17 H - 5 6) cj)^ 72 N - 2 0) (}) 3 Z + 2 4 (4- IS - 1 3 ) ] 

^l = i. }cj) a+ 26<P b + 40Cp i5+ 34<p A +1764^ 144<j>/+7z] 

f v 44 v 56 v 2oo< ^ +116 596 C 
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+20 16 Cp z35 + 1512 <p“+ 1584 <p“+ 2832 cp^+4176 <P 2 f+ 768 <J 5 j 

^[V 66 lx +2 ^ 

+ 73 4 4 (j^ 37 + 1 37 76 $ 24 fe + 7548 6280^+25344 

+5672 <^+11446 ^+73440^+51048 <j ^+ 119 232 ^ 

+ 1245 6 <p*+ 49248 cj£*+ 11059 2 <(£*+ 9504 <$* J 

Therefore, substituting from (24) into the expressions for 

the semi-invariants of oC 4 given bv the relations (2) we have — 

4- 

+ \fai)+k (-174-8^-9^-169^-48^)^(1299-9^ 

1 N 

'92 4^,+ 4Q9 <^(+ 15 4^ 3 - 4Z0 <^ 2 - 360 <$ 35 + 272 (j^+1341 ^+44^) ■ 
V-^(l28+64(J)J+^ (-1668+72^-462^-1256^+288^)] 
+ 2°V (-384 -Z2<P b -Z0 408 - 132 <p * ) + ^ (5664-35 <P fl 

-mi c}> 6 -t- 280 0 6600 4^-lZOOcp^- 1400 <P 35 +1464- 

+ 240(p 46 +66<p 4 3 )] + |^(l320+7(j) e +244(p & +494(l) 4 2 + 4^+2376^ 
+1800 4> + 336<f> 5 +16cj^+96<j)f )+ i (- 3 2208 - 1404> fl - 6544 <f> 6 
-2440(j) 4 - 37192^- 7500O(f) 4 -45168(f) 3 i -7O564) 35 + 704 <P 26 
« 896 <P a7 - 532 4)^+ 1856 <P/+ 953 6 <Jj£+32256<P a5 +24192<|^ 
-584(^il6(j)^+45312^V66816(P/ } +lZ288^ 5 -36(P 40 -184O(| 45 
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( 25 ) 
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-80(^-28804)* 11920^-960^-34.^- 504^)' 

^ f- 208o+ 6H- 648(^-1824^, 256cf> z ) 

+3 \ (7488 - 28 4^-16(^+4608 <|) 2 + 10592(j) 4+ 288^ 

-HZO(J) 55 +360(j) 4fa+ 2176(j^ 1 )+3^ (-28096+ \Z<$_ -08<P Q 

-53U$ b -Z15l 4> 4 -38776(f -67872 (j> r 3772a4 ) 3 Z -35Z6 <f£ 
+5Z8<f) 2S +672 (p 37 -2008^+ 1392 (^+9216^+ 7152 ^ 
+24192^^+18144^-108004^+33984 4>f + 501 1 2 (j^ 2 
-4416 (f> 35 + 7152^484^-2368^-98(^-11764)^ 

- ^0 4^' 480 4^ Z j+(. 14912 - 54^.- 18 4^. -330 c^- 6 4^ y - 10404^ 

-1332£p 4S -384O(p 57 -160<^-73eO^-3672O(P 287 -6808O 4^ 

-39240 <j)£-lZZM<{£-7Z5 76^+ 23120(^-57240 “ 

-367200 (p 2 Z “- 255240 <j£-596160 <^+35568^246240 
-552960(P 2 3 3 Z -475Z04) z b -7924) 26 -l006(P 37 +47064({ 6 -2088 4 ) 5 Z 
-10728 (^f- 36 288^ J5 -27216(P 24 + 330480 (g*-50976 <j£ l 
- 75168^ -13824(f> 5 - 264 4^- 336(^+3048 <£- 696 <pj* 
-3576(P^-12096(P 2345 -9072^174Z4(P“-16992<£ 2 
- 250564^-4608 +72 <P 66 +4324^g + i008 (() 0 +38l6O4 ) b 

+3456(p 35b +207364) 3 5 +48384(P 35 +305496(P 4 Z +362304(? 4 
+ 277344^ 2 +24-(p 43 +1152^+42(g+15l2(p 34 V504^8lb(? 4 )]. 
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C. The semi-invai iants. of 0^(=V^,) > viz. ct^ , , dj , etc. 

Now c^ = and g. p (\£)= — 1 -~p-- from the 1 elation (21 ) 

Therefore, the “$>” functions here are — 

‘ti.“^{f N ' 1 d < t,lr '“"'O’ 9-r ■ 9v 

-Z)(P*+2(N-1) 

( 26 ) 2 1 


On substituting from (26) into the expiessions foi the semi- 
invariants of eftf from the relations (2) gnes — 



d 4 =0 ' 

Section 4. The case in which the compound frequency func- 
tion may possess non-zero scmi-iniKirianls of all order, s, 

Instead of the components of the compound fiequency func- 
tion being considered normal, I now assume that they may possess 
non-zero semi-invanants as far as the third order, and t again 
derive expressions for the paiameters of the compound m terms 
of the parameters of the components, The method of derivation 
is entirely analogous to that in Section 1, where the components 
werenoimal. The expiessions for the u n , the semi-invariants of 
the compound aie seen to be more complicated in the present case. 
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but this complexity is more apparent than real. In fact, I have 
succeeded in deducing a rather simple general law, by means of 
which,these expressions may be written out, and this law is still 
applicable, even if the two components should possess non-zero 
semi-invariants of higher orders than the third, 

I now write 


(27) e 


= pe 


, zt 

i +i 


1^3' 



b \ i ct 3 ) 
V* 31 


where 


cum.-m 


i ^ 


b 


i> 


c vV 


A. 


r = 


a 

p ’ 


and m , m , d , d , have the same significance as in Section 1. 
i. 2- i. 4 

and X > X , are the third semi-invariants of (j)^, respec- 
L 3 Z 3 

tively, and as before p+q= 1 . Taking logarithms in (27), we 
have 


t 2 , t 3 


L | t+ L ^t-L 3 3.n--" 


(28) 




further 


(29) ty- 


l+re 


4lL? Hl K ' 1 -qy.A t^Ur), 
f K*1 k V y 
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in which 


3 » at + 


bt z ct 3 
i\ + 3! 


The right member of (29), may be put into the form 

<*> k (-lf^q'VkX i J E> J ) 

ES -V 1 S V 

k*i U1 k NW j.o J 0 


t 

Therefore equating corresponding coefficients of — } in the right 
and left members of (28), for n>3, gives 


(30) 


n 

Z L 

Ml U1 


k (-O^qVk 


l J .nl 


c<-!^! <r l...(2!)' S ( : 3!y 


^a)W, 


where the last summation is taken over all values of j , such that 
the following diophantine equations are satisfied. 


(31) 


*+£> + r -j. 

^ + 2(5 + 3^ *n, 


Using (30), I obtain, for n * 1 to 8 inclusive, the first eight 
semi-invariants of the compound as follows, 


L i -m 1 +-acj i , . 
L^o^+c^a^l-cp-t-b). 


A' 3 +3 < lt H';l +C£ ^• 

L.4. ^ ■ : q4 q,^+ ba 2 b + 3 b ^ ^ + 4 ac q,^ . 
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L s = aq' 5 + 10 a a bq^ 15ab 2 ^ + iOa 1 ^ + 10 bcq'^ 

^32) L fo = a\' 6+ 15a 4 bq l g + 45 1 5 b ZOac^ 

+ 60abcq,’ 3 +- lOctq’^. 

L 7 - a'qt*- 21 a 5 bqU 105a 3 b z q g +• 105 ab 3 q'^ 

+ 35 a 4 cq ( 5 + Z10a z bcq^+ 70ac 2 q‘ 3 + 105 b 2 cq 3 

U 6 = a \8 + ^ afeb£ 77 + ^10a 4 ^q' fa 4-4-Z0a 2 b 3 q' 5 
-i- 105 b 4 q,^ + 56a 5 cq^ + 5b0a 3 bcq^ 

+ 280a 2 c 2 qU<S4Oab 2 cq^+ 260 b c x c? . 

in which q ^ sq^-pq, qj=pqq^, etc,, and are in fact the same 

polynomials that occurred in the discussion of the case for normal 
components in Section L 

The expression for the L n in (5) may be put into a form 
similar that that of (30), and then, if we compare these two forms, 
it is obvious that no new polynomials q* will occur, in addition to 
those which appeared in (5), and this would be true however many 
non-zero semi- invariants the two components may have. 

Using the results established for the L n in Section 1, it is 
evident that, if we consider a particular semi-invariant, say L n * 
of (5), the terms in the right member can be readily written down, 
if we determine all the j part partitions of n, where j and n are 
fixed, using the integers 1 and 2 as part magnitudes. Suppose we 
havc^ parts, each equal to 1, and parts, each equal to 2, where 
otf -vffb =* ] and o t-v£{3-n , then such a partition corresponds to a 
term of the type a 4 b^qb (omitting the numerical coefficient), 
the factor a 4 arising from the last summation of (5). which 
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is clearly seen, if the latter be put into the same form as (30). In 
addition, it is to be noted that any j part partition of n will ,be 
unique for this case. 

Now if we consider the case of the present Section , for n>3 
the L n of (30) will be seen to contain, as well as the same terms 
of the corresponding of (5), some additional terms, the latter 
appearing on account of the fact that, since the integer 3 is now 
admitted along with 1 and 2 as a part magnitude, the j part parti- 
tions of n will no longer be unique for every possible value of j . 
These j part partitions of n will, moreover, give rise to terms of 

the type Cl b H C where the relations (31) are satisfied. Fur- 


ther, the total number of terms in a given L n> for n not too large, 
can be readily obtained by making use of the so-called “enumerat- 
ing function, 11 discussed in works on combinatorial analysis, which 
enables one to determine the number of partitions of a given integer 
ri > when the number of parts j , and the part magnitudes 1,2, 3, etc 
are fixed. 

It would appear, from the above discussion that the partition 
method of obtaining the terms of L n could be carried over to the 
most general case, in which the components may possess non-zero 
semi-invariants of all orders. 

I shall now indicate, without going into detail, that, if in the 


expressions (30) for n>3 , I set every q, 1 equal to unity, then 
the become functions of a, b. and c only, which I call p> n , and 
the latter obey a recursion law analogous to the one established for 
the of (10), namely 


P n+i =1 ( ci ''"^$ a , +c $£ ) ) Pn 


where now 


(V 


dt n 



bt ^£+ 3 

2> 3! 


bv definition. 


t=0 
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(34) ^t n [ e ^] =P n( c » b+ct ,a+-bl+^) e^ (t) ( 

where P n y, z) is an rvth degree polynomial in x, y, and z. 
Again, it is readily seen that 


,#) d 


dt n 


P n (c, b-vct, a + bt + ~ ) 


(35) 


t=0 


~( b f a * c di>)1 


P n (c, b+ct, a-+bt+ )■ e 


t=0 


and that 


<*> V n^j e<P(t) 


t=0 


a-V n -e 


cp(t) 


t-0 * 


Now deriving the left member of (34), with respect tot, and then 

setting t -0 gives the next j5 , viz, > whilst the derivative of 

the right member of (34), then setting t=^0 , would equal the 
sum of the right members of (35) and (36), and thus the law in 
this case is established. 

It is at once apparent that the recursion formulae of (13) 
and (33) may be generalized, so that, if the two components should 
possess non-zero semi-invariants of all orders, the law for the |3 n 
would then be 
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a, b, c, d, etc. being the differences between the 1st, 2nd, etc. semi- 
invariants of the two components, respectively. Thus it appears 
that the actual writing down of the expressions for the parameters 
of a compound frequency function in terms of the parameters 
of its two components may be reduced to a partititon process, and a 
taking of derivatives. 

Section 5. The limiting compound frequency function , when 
the number of components is allozved to become indefinitely large. 

It is to be noted that if the compound is assumed to be com- 
posed of a greater number of components than two, then the 
mathematical development becomes heavy, but a rather interesting 
case arises, when we consider the form of the limiting compound 
frequency function, when its components, infinite in number, and 
identical in form, each contribute the same proportion to the total 
frequency of the compound, and have their means distributed ac- 
cording to the known frequency law f (x). 

First of all I consider the compound to be composed of a finite 
number of components, say Mt-1 , of the type indicated, and later 
pass to the limit, allowing M to become indefinitely large. 

I write now 






i * 

in which for all l « 1, 2, 3,. « . • • etc, and A 

etc, are the 1st, 2nd, etc. semi-invariants respectively of the i-th 
component. The right member of (37) may be written 
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in which rn^is the mean e c some component and m • is the mean 
of the L -v-l)st component. 

If we now assume that m^= , then 

D L i t+L ZXl. + L 3"3i + '■ 


(38) 


= e 


t 2 l 5, 

\l* +A 133!" 


1 

MU 


ml 

1+E e 


m t t 


and the right member of the last relation is the generating function 
for the moments of the compound frequency function. Allowing 
M to become indefinitely large, we have 


lim 


A- 

M+l 



m-ft 


9 u iTl a"t 

+ e ’< 


e m M+i t + 


00 

j ■ e f (x) cbs *Gr x CO. 

J-oa 



Therefore the limit of the generating function (the right member 
of (38) is given by 


+1 , t* 




so that the semi-invariants L n 
function are given by 

fl -t 3 

e L 1 ttL z .^+L 3 3i + '‘ 


of the limiting compound frequency 


±1 

• A U 2i + A i3 3 ! +---- 

= e • G*(t) . 
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From this last relation, it will be seen that the mean of the limiting 
compound is equal to the mean of the means of the components. 
Further, if the means of the components are normally distributed, 
and the components themselves are normal, then the limiting com- 
pound frequency function is also normal. More generally, if the 
components are normal, and their means follow any frequency 
law ffr), then the limiting compound function also follows this 
same law. If now, considering the most general case of all, where 
the components may have non-zero semi-invariants of all orders, 
and the means of the components are distributed according to the 
frequency law F ifrj, then the semi-invariants of the limiting com- 
pound frequency function may always be calculated, and will be 
given by 

in which L. , , are the k-ih semi-invariants of the limiting 

K IK K 

compound function, of one of the components, and of F(St)respec~ 
tively. 

This shows that the variate z of the limiting compound fre- 
quency function, is distributed as if it were the sum of two inde- 
pendent variates, one of which is distributed according to the law 
of the means, and the other according to one of the components. 
To write down the actual distribution function for the limiting 
compound is quite another matter, but since we may write, in the 
limit, when 


t l i * 

L 1 -t^L z j,+-L 3 -5,+- 


> 1 %. 
12 Z'. 13 3! r 




then, the distribution function sought, provided it fulfills the neces- 
sary conditions, may be given formally by means of the Fourier 
Integral Theorem. 
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PART 2. 

As indicated previously, in the introduction to this paper, we 
are concerned in this second part with an entirely new problem, 
in which we are now sampling from two distinct parent populations 
instead of from only one, as in Part 1. Hence, in order to obtain 
'the desired sampling results, we must have recourse to an entirely 
different method of treatment from any we have made use of here- 
tofore. I shall suppose that the two parent popula- 

tions may possess non-zero semi-invariants of all orders, and that a 
random sample of r is taken from the first population, and a ran- 
dom sample of s from the second population, these two samples 
being then combined to give the composite sample from the com- 
bined populations. 

Section 6. The semi-invariants of “moments about a fixed 
point ” in samples from the compound frequency function . 

With the above hypotheses, I shall derive in this section, expres- 
sions for the semi-invariants of “moments about a fixed point” in 
samples from the compound population. 

Calling the required semi-invariants 5^ , 5^ , 5^ . arc, we 
have, by definition 

t 2 t 5 

5 l t + 5 2T! +-S 3 3! + "' 

e 


(39) 




-/ /WxMT&W]' 


n E+S n 

l-l j*r+l 


Vr 5 


in which (() (x) , ty z Q0 are the initial parent frequency functions, 
and X • , X , indicates that the variate was taken from the first 
and second parent respectively. By a suitable transformation of the 



324 ON SAMPLING FROM COMPOUND POPULATIONS 

parameter in the power of the exponential which appears in the 
right member of (39), this same member may lx? put into the re- 
quired form 



t k 

On Equating corresponding coefficients of in the last ex- 
pression and the left member of (39), I get 


<* fj 3 . rk - ) +sk -s K K) 

KUJ ~ (r + 3 ) k 


in which ^ (v£), 3j(v'| ), 5^ (v'J ) , etc., and 5^ (\Z'), 


5 3 (v^ 1 ) , etc., are the 1st, 2nd, 3rd, etc., semi-invariants for 
in samples from the two component populations andC^fc), 

respectively, the values of which are well known. 1 


Section 7. The semi-invariants of “ moments about the mean }i 
in samples from the compound frequency function . 

Employing the same sampling procedure as in the last section, 
I wish now to consider the semi-invariants of “moments in sam- 
ples from the combined population, about the mean of the com- 
bined sample” and to express them m terms of , and (5 ^ , 

(5 Z , (the semi-invariants if the component distributions 
respectively) , and r and 3 . 

In order to obtain the desired results, I have made use of a 


1 Loc. Cit. pp. 12-13, 
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modification and extension of a method originally employed by 
C. C. Craig 1 for the case of sampling from one normal parent pop- 
ulation. I shall first develop the theory for my case, on the basis 
that the two parent populations may possess non-zero semi-invari- 
ants of all orders, imposing the condition of normality only when 
actually computing the desired results. The mean of the combined 
sample is 



which 5: ~ and r-hs~N , for particular values of n, 

and for infinitely many sets of r+s variates, assuming that each 
member of each set is independent of all the rest. TheN , 6 s in each 

N 

set satisfy TL 6 ^ - O . 

Now, let F (6 6 “-6 )be the correlation function of the first 
N-l 6 'S. ThenF( 6 1 , 6 J • 6 ^) 016 ^ 6 ^ dq^ gives the probability 
that the first N-l 6 s fall simultaneously within a cell 

rVxcLS^ S±\ d6 2 , . Wl d6^ x ) 

The semi-invariants of F (<5 t , 6 ^ • • • 6 N _^are defined by 



1 Loc. Cit. pp. 1 to 35. 
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where e.g, 


/ 2 , , 
^ (&W = ^ 20 ^ 1 + ^ + ^OZ^z: 


Setting 


we have 


N 


6.- = 23 CL ; 56: 
L J-l ‘■J J 


a 

a 


1 • , . . 
Lj “ ” N ’ 

. = 

Lu n 



e (l^cH r.+l't'v) + 

^gwi)4(&w t 

€» 
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m which c/ 1 ,* z , etc , are the 1st. 2nd, etc. semi-imaiiants of the 
first component and etc. are the correspond, >,« 

semi-invariants for the second component $„(*). It follows then 
that 


iWki 


'N-i 


N-l 


N-l 


N-l 


- *(swi V< 


or 

A,. 


\L*1 
^4 k* k 


A .. «&& 

* ft kp*l 0 2^r*i' a »l?rtl* a i,r*2 a J, 2 rt2' a N'Slr« 


N-l I 


..hi 




+ a i,r+S a z, r+s’ a M-l j r+s 


where k^+ k_ + • • • + k |s j_ 1 * k 

so that 

x _J fr-aV 2 a kf1 ' 1 ^ 

Vz j,.l ‘1 2j- ' a N-l,j 


rns 


^k ) ‘^ 1 4 a ii a 


k i *2 *N-1 


- ‘Vi.j 
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By substituting for the a’s from the relation (41) in the last equa- 
tion, the latter may be reduced to the following convenient form, 

where, now, , provided that, 

at least one of the k- u in k , . . .^is equal to zero. 


1=1 



(42) 





In the case of one parent population, all ?\. ,, k of the 

same type, i,e. whose subscripts k^., . are merely different 
permutations of the same set of integers , k 2 , , . M , were 
equal This is no longer true for two parent populations, for we 
must now distinguish between the 6 s which arise from observa- 
tions from the population and those from I therefore 

introduce, at this point, what I shall call the “bar notation”. For 
example, from the relation (42), all |k l< r ^will 

of the same type, and therefore equal, if the first r subscripts are 
merely different permutations of the same set of integers k r 

whilst, quite apart from the first r subscripts, the last 3 subscripts 
are also different permutations of the same set of integers k r+ ^, 
k r+ £ . . . k r+3 . In writing down the semi^invariants of the cor- 
relation function F , using the “bar notation", for convenience, I 


shall suppress zero subscripts. Further, on account of the relation 

(42), alU' k 1 K z ...k r jK r+l ..K m for which k l + V" k r +W r+l + - fk r«^ 
will vanish, since we are assuming now, that our two parent popula- 


tions are normal As a matter of fact, the only 
that I shall require here are the following, 
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\|0 N { ^ ^ij • 

A 0|l " N j ^ ^lj ' 

A 2|0 = ^{ r V s ^ + N ^' 2 )^]- 

A 0|Z = H 5 -j ro< 'z +s ^z + N(N-Z)^ x . 
\l(0 +3 Px“ 

A 0|11 = NM r ^ ^ S £V 2N A* • 

= r^| p ° < '& *■ S A&~ ^ ( D ^z, 4 '^)| ■ 


The above expressions were obtained from ( 42 ), after assuming, 

(without loss of generality,) that , + s(3 =0 . If, instead of 

this last assumption, I had assumed that or A^ were equal 

to zero, not only would the symmetry of the final results have been 
destroyed, but the amount of labour necessary to obtain them would 
also have been doubled. The symmetric substitution actually made, 
required that only half the final number of terms be obtained, the 
remaining half in any particular result being readily written down 
by interchanging the cC's and (3's as well as r and s . ^ g n 

Now, let P(V^)be the probability function for 2 ~~ 
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The semi-invariants ot P(v£) are then defined by 



Regarding the use of F(6 Ai 6^,* • 6^) instead of \r(5^6£ } ' & ^ 

in the above relation, see paper by C. C, Craig. 1 

We wish now to express the semi-invariants in terms of 

the semi-invariants /\, v ^ u- , of the correlation function 

K N 

F (6 4l & 9 ‘ * • 6.,) for the 6s. The semi-in valiants L _l 
‘ 1'4 N ' rsx ... 

of the correlation function for 6^ are defined by 



by expansion of the exponential function. Then, comparing the 
1 Loc, Cit, pp. 18 to 19 
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(45) S = 


kl 


• L. 


K N k i k x ! ' ' V- Klkr ' Kr 

in which the summation is taken over all values of K k U 
such that 

k t +k z +- • + k p » k . 

Making use of the explicit relations for semi-invariants in terms 
of moments and vice versa, we have from (44) and (40) 


(i Wif 


(46) -EE. -E 


w'lwtcif- 

C(£/n l t i ) la> ] r [(5 1 v'„ i 4 t ) Cb, ]k 

r! s' 41 


(47) 


/ N 

\ U1 L 1 


(n) 


= ££ .x 


(a! ) r (bl) s (cl) t . . . . Haiti 


where, in both these relations 


a > b > c * 


and 


ar + bs + ct + * ^ n 


From the relations (46) and (47), the L n can be found in terms 
of the moments of F , and these, in turn, can be found in terms 
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of the semi-invariants of F , by equating the coefficients of like 
powers of the 1 5 on both sides of the two equations. Examples 
of the kind of relations obtained from (46) and (47) in particular 
cases would be as follows, 



Therefore 


L -V V 

210... O 4-20. O AO...O 020. o 


'^ZZO.. Q^20. ,0+Ho o'ozo .0. 


/M /N f N ^( 4 ) / M 


Vi 




(*) 


N \0)1 


+ io l(f i ¥ i 


N \(2)1 3 

+ 15Hg/ili 


Therefore 


izo 0~ ^420 . 0 + ^40 o'^OZO .o\l o. -0 

0 ^ 10 . - 0 + ^210 . o * ^^30 -O ^izo. . o 
4 ' 3 ^ZO . o' ^OZ0.,,O + ^^ 20 .. 0' ^ iio.,.0 * 


In m> work, I actually make use of the following relations obtained 
from (47), with certain terms omitted, which vanish when each of 
the parent distributions is normal. 
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(,) (SM-teW 
(i,) 

(m (s# ik4\k*4lk\\f . 


N \(4) f/M 


2 /h \(z)r/N 


(iv) -W7MF.7V WfeA‘i)J- 

+ 4WtfiiW*[( ! Ei 5, A)] 5 


n \(6) r/N \Wi 2 r/N 


(vl) (fi'tv - 45 K6^ 


£At, 

l-l 1 1 




N . \(7) 


,(i) 1 2 [ 7 n 


' io wv j 


n \«)r/M 


N \(zVl 5 /H 


+a (&w M* 10 W J (sawmjj. 


N \(7)l Z r/H 


N \ft) 


(viH> tei^V -“IISAV J I'M ♦“lEAr * 

7n 7M \ (7)7 J r/N \l z f/N NUTf f/N M® 

MHp J MHsw ]*M- 


By substituting the expressions for the L’s given in (46), into the 
right member of (45), a direct expression for the S^'s in terms 
of the moments of F is obtained, viz. 
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1 K! 

K n k K rz- K r k k> ... kl 

l 2. r 


n k 


N 

£ L: 
1*1 1 


(k) 


omitting the parameters , and in 



(46). From relation (43) to this point, the theory follows exactly 
that given in the paper already cited. 

I shall next quote the final expressions A for some of the 5^(v u ) 
obtained by C. C. Craig, 1 in terms of the moments of F , for the 
case of one parent population, and then I shall write down the mod- 
ified expressions B , when two parent populations are involved. 


A - . 

2> - N* j N ^n,0 + N^K.n.o" N ^n , o ] . 

B . 1 . | r ^ n | 0 +5V, 0 ^ n | 



rvi 


£n 0 


* W oUr^< r - 1 K,n|o*=( 5 - 1 K ln _ n 




n 




o +sV oln +2r5v/ * 


nlo" v o\nl' 


In the paper mentioned above, expressions were also derived, using 
a method similar to the one already indicated, for the semi-invari- 
ants of the correlation function of two moments about the mean. 
I have made use of a modification of only one of these expressions 
as follows, 


1 Loc. Cit. p. 22. 
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A * ) NV m+n ) O +N ^"0^ m) n ) o“ NZ ^ jO N n,c 


B - 5 iAm/n ) = isji | rv m+n|o + sv o|m + n + ^ nlo 


s ( 5 ’^ v o| rn ,n + r 3 *^11 n + r 5 ^ 


n m 


rV m|o^n|o +s2 ^|m N/ oln +r5v/ m|o v/ o|n +rsv/ o|m^n|o 


if ere again, for the moments of the correlation function, I 
tniploy the “bar notation”, its meaning being exactly the same as 
in the case of the k r |K r+ ^ k r45 J di scuss i°n regarding 

identical types of moments and their equality, corresponding also 
in every detail. Once more, zero subscripts are suppressed. 

It now becomes necessary to express the modified moments in 
the expressions B , for particular values of m and n, in terms of 

the \ , of the correlation function F . 

ri' * K r 1 K r+1 * k r + s 

To this end, I make use of the relations (48), in conjunction with 
the so-called “D s operator of Hammond” 1 which splits off a total 
integral part s , made up by addition from any or all of a permu- 
tation of integers. 

At this point also, it is necessary to modify somewhat the use 
,ii the Dc operator, because of the “bar notation” used to designate 
i he moments and the semi-invariants of the correlation function F , 
*\ hen two parent populations are being considered In making up 
she total integral part 3, split off from the permutation of integers 


1 MacMahon — Combinatorial Analysis, Vol. I, p 27 



336 ON SAMPLING FROM COMPOUND POPULATIONS 

k i k 2) , <( W r | K r+i . .k p45 , the parts split off from the set of 

integers k^k 2 * • k r must be kept distinct from those parts 

which are split off from the set of integers k r+ j, k r+2 ,.. k p+3> and 

this same rule applies also to the residual permutations from each 
of these two sets, after all the parts, with sum 5 have been finally 
split off. Hence, the use of the “bar notation” to effect this dis- 
tinction. To illustrate exactly what is meant here, suppose that I 

wish to express in terms of K rH _ s * * n 

case, I shall use the relation (v) of the set of equations (48), and I 
shall merely consider the contribution made by the second term in 
the right member of (v) to the final expression for , the other 
terms of (v) being treated in a similar manner. Now (omit- 
ting a numerical factor) is the coefficient of in the left mem- 

ber of (v). I therefore seek the corresponding coefficient of 
in the second term of the right member of (v) this term being 

N k(Z) 

. Using the modified form of theD s oper- 
ator, we have 

DjD*(3|z)-(2|0)Ei(i|zM0|z)D*(3|0> (i|i)d*(zU) 

.3(z|oXi|oXo|l)"*(o|2)(l|o)+3(l|l)(l|o)to|l) 

Now, we are able to write down immediately the terms in 
'•'vkr+i' •• ^r-v -3 which arise. They are 

(49) 3 \|o A i|o ? 'o|i + - ;> 'o|2.^1|o + - 5; 'l|i ;> 'i|o/o|t 
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Ordinarily, the numerical coefficients in (49), will need to be mul- 
tiplied by an integral factor, obtained as follows* A term t* \\ may 

/ N \ (s) q’l 

be chosen from the expansion of / 2L ) in -or 10 (in 


general, in C^) ways. The numerical coefficient of the second term 
in the right member of (v) is also 10 (and in general, is, say C z ), 
The required factor for the above example is unity (or, in general, 


the quotient — . It should be noticed in addition, that the sum 
C i 

of the coefficients in the final expression (49) should equal the 
numerical coefficient of the second term in the right member of (v) , 
with which we started, and this is seen to be the case. 

As a check, one may observe, that if in the results which I 


obtain for 5^ (v m y n ), the two normal parent populations 

are identified, then the results for a single normal parent population 
are obtained. Note that, to get the results as usually given for a 
single normal parent population, one would further have to set 

fix ™ O* 


I derived the following results, which have been checked by 
calculating the corresponding results for a single normal parent 
population, without assuming that the first order semi-invariants 
of the type ^ l0Q , . . , o0 are e 9 ual t0 zero< 


s i(vJ (N-i Xr<V 3 spf ) - 

v 

3 4 (^)- L ( 3(N-Z)(r^+ 3(5 t ^)+N(r^ 3 + s #) ■ 


3 jv 4 )=i3 |3Hr| N(N-2A r (2N-3) J ^3Ns j N(N-z)ls(2N-3)] 


ff 
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♦N'VJi- 5p 4 ) + 6N Z [r(N-ZN+r)^\ + 3(N 2 - 2N + s)(5 2 0.,] 

► [dNra (ZN-3)]^ ^+6 N 2 r s [*J |3 Z ] ■ 


S^) = ^- 4 jr(N-ZNt r )^ + 3(N 2 -2N + s )p^Zr 5 [( r ^ ( 3 ;l+S ^j3 i J ') 

+ *zA> + ( s *^ r 8 i A:)' z ( s ¥if’i + r *i PiAt)] ■’• 

i z (v 3 ) = ~ b |6r[N -N%NV's(r +5 Ns)^ 1 j U^+Gs[n 4 N 3 + nV 

-r (s z +5 N r)j ^ f> 2 + SN^rs [(s«c£< t + r A 4 /5 z )+(n* 4 A z + sK z&*) 
-Z( r* z flf 7,, )] t-3r3 (3j,+ ) 

-t-Srs^** PiA 2 )-2 js(N z -2N-r)* z & z + r(N -ZN-s)* 2 £3 Z J 
+ Z |[n (b-M)(r- s)+ 2r z ] »z^^z + (^'N)(s-r)+2a z ’]»« 1 2 '«t 2 /3 Z j 

+ {[r(ZO-IZN + 3N Z ) - N(zrN Z -10N-<-i6)]p<. z /3 2+ [s(ZO-i2N- ) .3N z ) 
-N(2N 2 -10N+lb)]^/3 2 ] +r[r z (3N 2 12N+20)-Nr(6N-30N+48) 
+N 2 (5N z -24N+32)]°t/ +s |s 2 (3N z -12N+20)-N.s(6N-.30N+4£) 
+N Z (5N-24N<-3z] /3, 3 I. 


J2 (v 4 )=-^ j31N Z r(7N 4 -Z0N 3 -aMr+38N 2 r + 4N V- 28 Nr+ 7r% 4 


-ON 5 (7N 4 ZON -8N s +38 Ns + 4 NV- Z<9 Ns z + 7 s 9 ) (£ & 


+ 2N 4 rs (s<Z^ 2 +rp2<3 2 )-» l r 8 Z + S"f- Z j - 2(s«^<j, (3* 

+ r^^> z )] +3N z r V A>z 

+ ^V 2 ^) + 12^%^)] +18N z rs (s^y 2 $+ r^|&* g ) 
+ lZN Z rs (N-NV'Zs 2 )*^^* A2 + (N 2 -N z s-Zr 2 )<>yt Ag 
+6N Z rs (5N z +ZN z r-14Nr +7r 2 )^ iZ /3 2 +(5N 2 +ZN 2 s 
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- 14Ns+ 7s z ) (5^ (3 z ^4-iZr5 (-2N -3N rs+ 12 N^r 
-15Nr z +5r 3 )<< 1 ^/5 i + (-ZN -3N rs + 12 N z s -15Ns 2 
tSs 3 )^^!^ +12N Z r5 s(-N +2N-r)* J *J/S! l 3 + r(-N Z 

+ 2 N - s ) ^ ^ ^ J + 1) rs "(3 N Z - 6 N V 1 5 N 5 - 1 2 N s H 3 5 3 
+8 rs 2 )*l pi + (3N V- 6 N \ 15 N 2 r - 12 N r z +3 r 3 

f“ 

4 Qrh)«lpl 4- 36 rs (~5rs z - NV N V+3 N 3 s -2N 3 
1 3 N 2 r)^ ^ ^ |9 Z + (■ 5 r z a - N 4 r - N 2 s z +3N 3 r -ZN 3 
4- 3N 2 ' s )°<'^ z p i f >2 j + 9rs (9N 3 -22 N z - 36Nrs472rs 
4 6N z 'rs) D 4 2 ^ 2 +18rs[f5r Z 4-7N 2 r 2 '-14NV4-N 4 r 4<9N 4 

- 29 N r \ 43 N 2 r - 21 N 3 )^* 2 £> z + (5s VrNV- 1 4|\| 3 s 

4- N s + 8 N 4 - 29 N s Z + 4 3 N 2 s - 21 N 3 ) ^ |3 Z ] 4 18 rs (n 3 

+ 4N z s+5N 2 rs 4-5r z s -^Nrs)^ 2 -^ (^ 2 4-^N 3 +4N z r 
+5N rs + 5rs z - (9Nrs)°4^ (3^ /3 z "j43r(36r 3 18Nr 3 

4-75 N 2 r 2 4- 3 N V 3 - 12 N 3 r 2 126 N V4-28N 4 r 4-84 N 4 

- 32 N 5 + 4N fa - 126 Nr Z +I62 N 2 r -78N 3 ) -cj 4- 3 s (36 s 3 
-16NsVT5 NV+ 3 N 2 3' j - 12 N*s*-lZ6 N 3 s 4-28N s +84N 4 , 
-3ZN' 5 +4M 6 -l26 M 2 + 162 N^s -78 N 3 ) p2+ 6r (5 r 4 39Nr 3 
4 102N 2 r z -f3N 4 r Z 49NV-36N 3 r z -121 N 3 r4-57Nr 
-5Kr+60N 4 -4ZN S +8N t )^ 1 Z ^ z 4fes (5s 4 -39 Ns 3 
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+ 102N V+ 3 N V+ 9 N V- 36 N 3 s z - 4.21 N s + 57 N 4 s 
- 6 N S 3 + 60 N 4 - 4- Z N 8 N ) (3^)3^ 6 rs (72 r % 66 N z - 5 J 
+75 N *7 - 36 Nr z -lZ6 Nr + 6 N z r z + 11 N 4 - 1 2 N (3 

+6rs(72s z + 66N Z -51N 3 + 75N z s - 36Ns z -l26Ns + 6N V 
+ HN 4 -12N i s )*$ 

3l 1 (v 2) v 3 )* 1 | jr[^(N-3)+N(N 2 -N-s)]^ +S [r*(N-3) 

+N(N Z N-r)]p 4 ^+ rs[s(3-N)<+ Z |9 i+r (3-N^ 1 p 2 2 ] 

+ r sjj(N - 2)(r-s) + Zs] ^ £> z +j(N-Z)(s -r) +2 r ] Q /9j 
+ Nrs[( 3 ^ V r^/3 (3> z ) 

- (r^ z ^ P 2 + *V Z Pi /V )] • 

3^^. 1 3 r(13 M z r-7N 5 + 2 N V z + 3N 4 4-N V + Z r 5 

-9Nr z -)^ Z oC Z +3s(l3N i s-7N 3 +2N 5 2 +3N 4 -4N 3 5 
+•25 -9 Ns 2 ) p^p* + 3 r (6N*+ N 4 + 3 r z - 5 N 3 - Nr z + 4 N r 
•BNr)^4 3s(6N Z +N 4 + 3a z - 5 N 3 - Ns z +4 N.(s - 8 N s) p 3 
i-Orsfer^Nr+N'r + ^N^N 3 )^/^ +6rs (Zs 2 - 

- 4 Ns + H 2 s + Z N z - N 3 )^ P a (3 2 + 2N rs F(s*Jv r fe) 

- r^p 3 pj -( 3 Vz p* + r^p 2 ) 
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+ (r*^ 0 z +9 °^(\ )J + 3rs Z(A-rs-^)^ z ^ x 0 z * 2(Zr-5NJr 
+ Z N K N V) p z + Z ( Z s a - 5 N * ^ ZN *•+ H z s )* z £? £> z 

+(N 2 - ZNs+ars + s 2 )^ 2 ^ 2 -*- (lM-ZNrh5rs v-r 2 )*. 2 / 3 2 
+ (9 r - QN 4- 4N 2 - 3 N r ) + (9 s <3H c 4- IS] 2 - 3N s )<^ $ j 

^i(^,v 4 ) » ^ ||v| r-(6 r 5 - 2 b N r h 37 N V+ 4 N V- 3 N V - Z° N 3 
+ 7 N 4 )^V 2 + N s ( 6 s : 3 - 2 b N i z + 37 IN *s + 4 N V- - 8 N \ 
-Z0N 3 +7N 4 )(3^(3 Z + r(75 IN r z -107 NV + 5£>N 3 - 19 r 3 
+ bNr 3 -30NV 2 '4 54 N r + 3N 3 r 2 - bN*r-4-Z N + 


4-6i7 S )^2 3 4-3(75Ma 2 - 1G7 N z s +56N 3 -i95 3 + 6 Ns 3 
-30N 2 5 z + 54- N 3 5 + 3NV- 6 nV AZN^+aN 5 )^ 3 
KlSrb[(-N z r4 ZH 2 -2s 2 - f 4.rs)'* i 3 p*+(-N 2 5+2N i -Zr ; ’ 

+4r s)^Ai J 3 rs[( * vy-r^Ma-^ & 

*■ r \ 5< S 0iM r */<V s,c *$] 


+N rs|(-N 5 -+ii N 2 +5N 2 r- lbr 2 - Z&rs) P 2 +f-N 3 
+11 N ; +5N z s- 16s z --Z8r5)“<. z< (3 i i 0 z "j + 3Nr3^(-N 2 3 


+3N 2 -5Nr + Zr z y^^+(- N z r+3\N Z - 5 Ns 

4Z3 2 J^S i pJ] + 3 ^ r5 [(‘ N;lr+NZ - 2N5+4r5 ^i'ijPi^ 
f(N 2 5 4N z -ZNr44r5)^<4 2 (3 1 (3 2 ]+ 3Nrs js(N 


- Z rUy x P ± 4 r (M- 2 & ) < L (3 Z & ] + 3r 6 ( 19 rKll VI r 

Z^ Z rtNr z - ITN r 4 11 N 3 + * 2 N V- &z 
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*3rsf- 195^4-1 Ns -23 INS bN5 z -17N*s +11 N 3 

+ ZN 3 s-N 4 )c< x (\0 Z + 3rs ('-19rs + 6lNi''S +7Ns 
-N‘-Nr-N 2 543N i r)oC. i ^ ( 9| + 3rs('-19 r5 + 6Nrs 
+-7Mr-N Z -Ns-N 3N z s)o<, z + rs (3IN 3 -9N5 
+ 19r? -8Nr +6N z s)o^ i ^+ f3N 3 -9N Z 4l9r5-sr\ls 




